将原始文本数据文件转换为单个Excel文件。
项目描述
txt2xls将原始文本数据文件转换为单个Excel文件。它使用maidenhair来读取原始文本文件,因此如果有maidenhair插件,任何类型的原始文本文件都可以使用。
安装
使用pip如下
$ pip install txt2xls
快速使用
假设有几个原始文本数据文件如下
# Sample1.txt 0 10 1 20 2 30 3 40 4 50 5 60 # Sample2.txt 0 15 1 25 2 35 3 45 4 55 5 65 # Sample3.txt 0 12 1 22 2 32 3 42 4 52 5 62
然后运行txt2xls
% txt2xls -o output Sample*.txt
它将生成output.xls文件。Excel文件包含Sample1、Sample2和Sample3工作表。
使用方法
usage: txt2xls [-h] [-v] [-p PARSER] [-l LOADER] [-u USING] [--unite]
[--unite-basecolumn UNITE_BASECOLUMN]
[--unite-function UNITE_FUNCTION] [--classify]
[--classify-function CLASSIFY_FUNCTION] [--relative]
[--relative-origin RELATIVE_ORIGIN]
[--relative-basecolumn RELATIVE_BASECOLUMN] [--baseline]
[--baseline-basecolumn BASELINE_BASECOLUMN]
[--baseline-function BASELINE_FUNCTION]
[--peakset-method {argmax,argmin}]
[--peakset-basecolumn PEAKSET_BASECOLUMN]
[--peakset-where-function PEAKSET_WHERE_FUNCTION]
[--raise-exception] [-o OUTFILE]
infiles [infiles ...]
positional arguments:
infiles Path list of data files or directories which have data
files.
optional arguments:
-h, --help show this help message and exit
-v, --version show program's version number and exit
--raise-exception If it is specified, raise exceptions.
-o OUTFILE, --outfile OUTFILE
An output filename without extensions. The required
filename extension will be automatically determined
from an output format.
Reading options:
-p PARSER, --parser PARSER
A maidenhair parser name which will be used to parse
the raw text data.
-l LOADER, --loader LOADER
A maidenhair loader name which will be used to load
the raw text data.
-u USING, --using USING
A colon (:) separated column indexes. It is used for
limiting the reading columns.
Unite options:
--unite Join the columns of classified dataset with respecting
--unite-basecolumn.The dataset is classified with
--unite-function.
--unite-basecolumn UNITE_BASECOLUMN
An index of columns which will be used as a base
column for regulating data point region.
--unite-function UNITE_FUNCTION
A python script file path or a content of python
lambda expression which will be used for classifing
dataset. If it is not spcified, a filename character
before period (.) will be used to classify.
Classify options:
--classify Classify dataset with --classify-function. It will
influence the results of --relative and --baseline.
--classify-function CLASSIFY_FUNCTION
A python script file path or a content of python
lambda expression which will be used for classifing
dataset. If it is not specified, a filename character
before the last underscore (_) will be used to
classify.
Relative options:
--relative If it is True, the raw data will be converted to
relative data from the specified origin, based on the
specified column. See `--relative-origin` and
`--relative-basecolumn` also.
--relative-origin RELATIVE_ORIGIN
A dataset number which will be used as an orign of the
relative data. It is used with `--relative` option.
--relative-basecolumn RELATIVE_BASECOLUMN
A column number which will be used as a base column to
make the data relative. It is used with `--relative`
option.
Baseline options:
--baseline If it is specified, the specified data file is used as
a baseline of the dataset. See `--baseline-basecolumn`
and `--baseline-function` also.
--baseline-basecolumn BASELINE_BASECOLUMN
A column index which will be proceeded for baseline
regulation. It is used with `--baseline` option.
--baseline-function BASELINE_FUNCTION
A python script file path or a content of python
lambda expression which will be used to determine the
baseline value from the data. `columns` and `column`
variables are available in the lambda expression.
Peakset options:
--peakset-method {argmax,argmin}
A method to find peak data point.
--peakset-basecolumn PEAKSET_BASECOLUMN
A column index which will be used for finding peak
data point.
--peakset-where-function PEAKSET_WHERE_FUNCTION
A python script file path or a content of python
lambda expression which will be used to limit the
range of data points for finding. peak data point.
`data` is available in the lambda expression.
首选项
您可以在Linux中创建配置文件为~/.config/txt2xls/txt2xls.cfg、Mac中为~/.txt2xls.cfg或Windows中为%APPDATA%\txt2xls\txt2xls.cfg。
默认首选项等于以下配置文件
[default]
raise_exception = False
[reader]
parser = 'parsers.PlainParser'
loader = 'loaders.PlainLoader'
using = None
[[classify]]
enabled = False
function = 'builtin:classify_function'
[[unite]]
enabled = False
function = 'builtin:unite_function'
basecolumn = 0
[[relative]]
enabled = False
origin = 0
basecolumn = 1
[[baseline]]
enabled = False
function = 'builtin:baseline_function'
basecolumn = 1
[writer]
default_filename = 'output.xls'
[[peakset]]
method = 'argmax'
basecolumn = -1
where_function = 'builtin:where_function'
我不使用Microsoft Windows,因此Windows中配置文件的路径可能不正确。如果有任何错误,请告诉我。
项目详情
关闭
txt2xls-0.2.2.tar.gz 的哈希值
| 算法 | 哈希摘要 | |
|---|---|---|
| SHA256 | 540925d69432b5bdd4bd6fe954948a4799a99c07d708b84c4fa4b7b2968e4b08 |
|
| MD5 | bc4f5e71364bfc5754ff8941cb6b4be8 |
|
| BLAKE2b-256 | 92d9e014899397f0259595a760a5acb4b1164ecd0599f9a63fe1a5f4d5c744e6 |