Python等价于将zcat结果管道化到P中的filehandle

#!/usr/bin/perl my $SRG = $ARGV[0]; # reads.fastq.gz open($fh, sprintf("zcat %s |", $SRG)) or die "Broken gunzip $!\n"; # -i: input -n: db name -p: program open ($fh2, "| formatdb -i stdin -n $SRG -p F") or die "no piping formatdb!, $!\n"; #Fastq => Fasta sub my $localcounter = 0; while (my $line = <$fh>){ if ($. % 4==1){ print $fh2 "\>" . substr($line, 1); $localcounter++; } elsif ($localcounter == 1){ print $fh2 "$line"; $localcounter = 0; } else{ } } close $fh; close $fh2; exit;

2条回答

网友

1楼 · 编辑于 2024-05-28 18:16:43

您可以使用以下函数解析整个文件并将其作为行列表加载：

    def convert_gz_to_list_of_lines(filepath):
     """Parse gz file and convert it into a list of lines."""
     file_as_list = list()
     with gzip.open(filepath, 'rt', encoding='utf-8') as f:
      try:
       for line in f:
        file_as_list.append(line)
      except EOFError:
        file_as_list = file_as_list
      return file_as_list

网友

2楼 · 编辑于 2024-05-28 18:16:43

首先，在Perl和Python中有一个更好的解决方案：只需使用gzip库。在Python中，有一个in the stdlib；在Perl中，您可以在CPAN上找到一个。例如：

with gzip.open(path, 'r', encoding='utf-8') as f:
    for line in f:
        do_stuff(line)

比花时间去zcat要简单得多，效率更高，可移植性更强。在

但是如果您真的想用Python启动子进程并控制它的管道，那么可以使用^{}模块来实现。而且，与Perl不同，Python可以做到这一点而不必在中间插入一个shell。在Replacing Older Functions with the ^{} Module的文档中甚至有一个很好的部分给你食谱。在

所以：

^{pr2}$

现在，zcat.stdout是一个类似文件的对象，使用通常的read方法等，将管道包装到zcat子进程。在

例如，要在Python 3.x中一次读取一个二进制文件8K：

zcat = subprocess.Popen(['zcat', path], stdout=subprocess.PIPE)
for chunk in iter(functools.partial(zcat.stdout.read, 8192), b''):
    do_stuff(chunk)
zcat.wait()

（如果您想在Python2.x中执行此操作，或者一次读取一个文本文件，而不是一次读取一个二进制文件8K，或者其他任何方法，那么这些更改与任何其他处理文件编码的更改相同。）

相关问题更多 >

编程相关推荐

热门问题

热门文章