罗莎琳德：重叠图

listTitle = [] listContent = [] #SPLIT is the parsed list of DNA strings #here i create two new lists, one (listTitle) containing the four numbers identifying a particular string, and the second (listContent) containing the actual strings ('>Rosalind_' has been removed, because it is what I split the file with) while i < len(SPLIT): curr = SPLIT[i] title = curr[0:4:1] listTitle.append(title) content = curr[4::1] listContent.append(content) i+=1 start = [] end = [] #now I create two new lists, one containing the first three chars of the string and the second containing the last three chars, a particular string's index will be the same in both lists, as well as in the title list for item in listContent: start.append(item[0:3:1]) end.append(item[len(item)-3:len(item):1]) list = [] #then I iterate through both lists, checking if the suffix and prefix are equal, but not originating from the same string, and append their titles to a last list p=0 while p<len(end): iterator=0 while iterator<len(start): if p!=iterator: if end[p] == start[iterator]: one=listTitle[p] two=listTitle[iterator] list.append(one) list.append(two) iterator+=1 p+=1 #finally I print the list in the format that they require for the answer listInc=0 while listInc < len(list): print "Rosalind_"+list[listInc]+' '+"Rosalind_"+list[listInc+1] listInc+=2

1条回答

网友

1楼 · 发布于 2024-05-15 03:34:44

我不确定您的代码有什么问题，但是这里有一种方法可能被认为是更“python”的。在

我假设你已经把你的数据读入字典，把名字映射到DNA字符串：

{'Rosalind_0442': 'AAATCCC',
 'Rosalind_0498': 'AAATAAA',
 'Rosalind_2323': 'TTTTCCC',
 'Rosalind_2391': 'AAATTTT',
 'Rosalind_5013': 'GGGTGGG'}

{{cd2>一个匹配的字符串{cd2>检查一个 ^{pr2}$

然后我们观察所有的DNA序列组合，找出匹配的。这可以通过itertools.combinations来简化：

import itertools
def k_edges(data, k):
    edges = []
    for u,v in itertools.combinations(data, 2):
        u_dna, v_dna = data[u], data[v]

        if is_k_overlap(u_dna, v_dna, k):
            edges.append((u,v))

        if is_k_overlap(v_dna, u_dna, k):
            edges.append((v,u))

    return edges

例如，在上面的数据中，我们得到：

>>> k_edges(data, 3)
[('Rosalind_2391', 'Rosalind_2323'),
 ('Rosalind_0498', 'Rosalind_2391'),
 ('Rosalind_0498', 'Rosalind_0442')]

相关问题更多 >

编程相关推荐

热门问题

热门文章