parris:利用iso-seq分析和注释圆形rna

parris的Python项目详细描述


H1> PARRIS:用Iso Seq 分析和注释循环RNA

开始

安装

^ STR 1 } $ PARRIS<<强>是用^ {CD1>}编写的,请使用^ {CD2>}来安装^ {STR 1 } $ PARRIS:

pip install parris

或,您可以按照指令安装^ {STR 1 } $PARRIS从源:

git clone https://github.com/yangao07/PARRIS.git
cd PARRIS
python setup.py install          # install main package
pip install -r requirements.txt  # install dependencies

另外,请确保系统中安装了^{}(>=v1.6)^{}(>=v2.26.0)

< H3>运行^ {STR 1 } PARRIS <强/> EH3>

命令示例1:

parris -t 8 long_circRNA.fa reference.fa gene_anno.gtf circRNA.bed output_folder

命令示例2:

parris -t 8 long_circRNA.fa reference.fa gene_anno.gtf circRNA.bed output_folder \
    --short-read short_read.fa \
    --Alu ./anno/hg19/alu.bed   \
    --all-repeat ./anno/hg19/all_repeat.bed

详细参数:

parris -h
usage: parris [-h] [-v] [-t THREADS] [--short-read short.fa] [--lordec LORDEC]
              [--kmer KMER] [--solid SOLID] [--trf TRF] [--match MATCH]
              [--mismatch MISMATCH] [--indel INDEL] [--match-frac MATCH_FRAC]
              [--indel-frac INDEL_FRAC] [--min-score MIN_SCORE]
              [--max-period MAX_PERIOD] [--fxtools FXTOOLS]
              [--min-len MIN_LEN] [--min-copy MIN_COPY] [--min-frac MIN_FRAC]
              [--minimap MINIMAP] [-f] [--high-max-ratio HIGH_MAX_RATIO]
              [--high-min-ratio HIGH_MIN_RATIO]
              [--high-iden-ratio HIGH_IDEN_RATIO]
              [--high-repeat-ratio HIGH_REPEAT_RATIO]
              [--low-repeat-ratio LOW_REPEAT_RATIO] [--Alu ALU]
              [--flank-len FLANK_LEN] [--all-repeat ALL_REPEAT] [-s SITE_DIS]
              [-S END_DIS]
              long.fa ref.fa anno.gtf circRNA.bed/gtf output

PARRIS: Profiling and Annotating ciRcular RNA with Iso-Seq

positional arguments:
  long.fa               Long read data generated from long-read circRNA
                        sequencing technique.
  ref.fa                Reference genome sequence file.
  anno.gtf              Whole gene annotation file in GTF format.
  circRNA.bed/gtf       circRNA annotation file in BED12 or GTF format.
  output                Output directory for final result and temporary files.

optional arguments:
  -h, --help            show this help message and exit
  -v, --version         show program's version number and exit

General options:
  -t THREADS, --threads THREADS
                        Number of thread to use. (default: 8)

Hybrid error-correction with short-read data (LoRDEC):
  --short-read short.fa
                        Short-read data for error correction. Use ',' to
                        connect multiple or paired-end short read data.
                        (default: )
  --lordec LORDEC       Path to lordec-correct. (default: lordec-correct)
  --kmer KMER           k-mer size. (default: 21)
  --solid SOLID         Solid k-mer abundance threshold. (default: 3)

Detecting tandem-repeat with TRF(Tandem Repeat Finder):
  --trf TRF             Path to trf program. (default: trf409.legacylinux64)
  --match MATCH         Match score. (default: 2)
  --mismatch MISMATCH   Mismatch penalty. (default: 7)
  --indel INDEL         Indel penalty. (default: 7)
  --match-frac MATCH_FRAC
                        Match probability. (default: 80)
  --indel-frac INDEL_FRAC
                        Indel probability. (default: 10)
  --min-score MIN_SCORE
                        Minimum alignment score to report. (default: 100)
  --max-period MAX_PERIOD
                        Maximum period size to report. (default: 2000)

Extracting and aligning consensus sequence to genome (minimap2):
  --fxtools FXTOOLS     Path to fxtools. (default: fxtools)
  --min-len MIN_LEN     Minimum consensus length to keep. (default: 30)
  --min-copy MIN_COPY   Minimum copy number of consensus to keep. (default:
                        2.0)
  --min-frac MIN_FRAC   Minimum fraction of original long read to keep.
                        (default: 0.0)
  --minimap MINIMAP     Path to minimap2. (default: minimap2)
  -f, --do-classify     Classify circRNA alignment into high-quality and low-
                        quality. (default: False)
  --high-max-ratio HIGH_MAX_RATIO
                        Maximum mappedLen / consLen ratio for high-quality
                        alignment. (default: 1.1)
  --high-min-ratio HIGH_MIN_RATIO
                        Minimum mappedLen /consLen ratio for high-quality
                        alignment. (default: 0.9)
  --high-iden-ratio HIGH_IDEN_RATIO
                        Minimum identicalBases/ consLen ratio for high-quality
                        alignment. (default: 0.75)
  --high-repeat-ratio HIGH_REPEAT_RATIO
                        Maximum mappedLen / consLen ratio for high-quality
                        self-tandem consensus. (default: 0.6)
  --low-repeat-ratio LOW_REPEAT_RATIO
                        Minimum mappedLen / consLen ratio for low-quality
                        self-tandem alignment. (default: 1.9)

Evaluating circRNA with annotation:
  --Alu ALU             Alu repetitive element annotation in BED format.
                        (default: )
  --flank-len FLANK_LEN
                        Length of upstream and downstream flanking sequence to
                        search for Alu. (default: 500)
  --all-repeat ALL_REPEAT
                        All repetitive element annotation in BED format.
                        (default: )
  -s SITE_DIS, --site-dis SITE_DIS
                        Allowed distance between circRNA internal-splice-site
                        and annoated splice-site. (default: 0)
  -S END_DIS, --end-dis END_DIS
                        Allowed distance between circRNA back-splice-site and
                        annoated splice-site. (default: 10)

变更日志(v1.5.9)

  1. 修复搜索已知拼接接头时出现的错误。
  2. 使用已知和规范的内部拼接链信息来指导反向拼接连接的搜索。

欢迎加入QQ群-->: 979659372 Python中文网_新手群

推荐PyPI第三方库


热门话题
ArrayList中实体对象上的JAVA泛型   带Redis的爪哇芹菜vs单用Redis   java在设备面向横向时隐藏标题栏/通知栏   java JXTreeTable:如何使用ComponentProvider为一列设置渲染器   java创建异常的成本与记录异常的成本相比   java在方法参数中使用setter传递新对象   java在一个类中的方法与另一个类中的方法交互时遇到问题   java如何迭代2个大小相等的ArrayList   Java getDesktop()。open在Windows中工作,但在Mac中不工作   从tomcat切换到jetty后的java“无法启动嵌入式容器”,Spring引导   java如何使用void方法   java如何在解组时在JAXB的ValidationEventHandler中获取节点值?   如何使用Akka Java API创建不响应的TCP接收器   JavaScriptjQuery在java中将记录上传到数据库时的进度条   如何在重新绘制时在java小程序中显示上一个图像