aakbar——氨基酸k-mer标记工具
aakbar的Python项目详细描述
氨基酸k-mer工具,用于创建、搜索和分析来自基因组或dna读取的系统发育特征。
先决条件
需要64位Python3.4或更高版本。建议使用8 GB或更大的内存。
aakbar的python依赖项是:biopython,click>;=5.0,click_plugins numpy,pandas,pyfaidx, 还有皮亚姆。运行这些示例还需要pyfastaq https://pypi.python.org/pypi/pyfastaq 包裹。
如果您没有安装满足这些要求的python,我建议您 Python<;https://www.continum.io/downloads>;在MacOSX和Windows上实现平滑 以及预安装的软件包。一旦安装了Python, 您可以在macosx上获得这样的依赖项:
export PATH=~/anaconda/bin:${PATH} # you might want to put this in your .profile conda install click conda install --channel https://conda.anaconda.org/IOOS click-plugins conda install --channel https://conda.anaconda.org/bioconda pyfaidx conda install --channel https://conda.anaconda.org/bioconda pyfastaq # only required for examples
安装
这个软件包在Linux和MacOS下使用Python3.5进行测试,可从PyPI获得。到 通过pip(或某些发行版下的pip3)安装:
pip install aakbar
如果您希望开发aakbar,请下载release 在顶级目录中:
pip install --editable .
如果希望直接从git安装pip,请使用以下命令:
pip install git+https://github.com/ncgr/aakbar.git
用法
安装程序将一个名为aakbar的脚本放在您的路径中。使用格式为:
aakbar [GLOBALOPTIONS] COMMAND [COMMANDOPTIONS] [ARGS]
命令列表可通过aakbar --help获得。当前可用的命令是:
calculate_peptide_terms | Write peptide terms and histograms. |
conserved_signature_stats | Stats on signatures found in all input genomes. |
define_set | Define an identifier and directory for a set. |
define_summary | Define summary directory and label. |
demo_simplicity | Demo self-provided simplicity outputs. |
filter_peptide_terms | Remove high-simplicity terms. |
init_config_file | Initialize a configuration file. |
install_demo_scripts | Copy demo scripts to the current directory. |
intersect_peptide_terms | Find intersecting terms from multiple sets. |
label_set | Define label associated with a set. |
peptide_simplicity_mask | Lower-case high-simplicity regions in FASTA. |
search_peptide_occurrances | Find signatures in peptide space. |
set_letterfreq_window | Define size of letterfreq window. |
set_plot_type | Define label associated with a set. |
set_simplicity_object | Select simplicity-calculation object. |
show_config | Print location and contents of config file. |
show_context_object | Print the global context object. |
test_logging | Logs at different severity levels. |
示例
bash脚本,实现计算和使用 硬壁菌和链球菌,连同从genbank下载的数据,将 在发出命令时在(空)当前工作目录中创建:
aakbar install_demo_files
在Linux和MacOS上,按照说明运行演示。在Windows上,您将 需要安装bash,脚本才能工作。
许可证
aakbar是在bsd许可证下发布的。