Python：一种用read（）忽略/解释换行符的方法

>header1 hereComesTextWithNewlineAtPosition_80 hereComesTextWithNewlineAtPosition_80 hereComesTextWithNewlineAtPosition_80 andEnds >header2 hereComesTextWithNewlineAtPosition_80 hereComesTextWithNewlineAtPosAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAlineAtPosition_80 MaybeAnotherTargetBBBBBBBBBBBrestText andEndsSomewhereHere

for s in SeqIO.parse(sys.argv[2], "fasta"): #foundClusters stores the information for substrings I want extracted currentCluster = foundClusters.get(s.id) if(currentCluster is not None): for i in range(len(currentCluster)): outputFile.write(">"+s.id+"|cluster"+str(i)+"\n") flanking = 25 start = currentCluster[i][0] end = currentCluster[i][1] left = currentCluster[i][2] if(start - flanking < 0): start = 0 else: start = start - flanking if(end + flanking > end + left): end = end + left else: end = end + flanking #for debugging only print(currentCluster) print(start) print(end) outputFile.write(s.seq[start, end+1])

[[1, 55, 2782]] 0 80 Traceback (most recent call last): File "findClaClusters.py", line 92, in <module> outputFile.write(s.seq[start, end+1]) File "/usr/local/lib/python3.4/dist-packages/Bio/Seq.py", line 236, in __getitem__ return Seq(self._data[index], self.alphabet) TypeError: string indices must be integers

1条回答

网友

1楼 · 发布于 2024-04-25 18:57:42

使用Biopython：

from Bio import SeqIO
X = 66
Y = 130
for s in in SeqIO.parse("test.fst", "fasta"):
    if "header2" == s.id:
         print s.seq[X: Y+1]
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

Biopython让您解析一个文件并轻松访问其id、描述和序列。然后就有了一个Seq对象，可以方便地对它进行操作，而无需重新编码所有内容（如反向补码等）。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章