尝试使用python以10的连续间隔查找字符串的特定字符

window_size = 10 windows_length = len(data) // window_size windows = [data[i:i+windows_length] for i in range(0, len(data), windows_length)] result = sum(1 if 't' in (x) else 0 for x in windows)

3条回答

网友

1楼 · 编辑于 2024-04-20 02:27:21

如果我理解正确，你想数一数有多少个窗口包含't'。然后我的方法是将data分解成windows，并计算其中有多少包含't'。你知道吗

window_size = 10                                                                          
windows_length = len(data) // window_size                                                
windows = [data[i:i+windows_length] for i in range(0, len(data), windows_length)]        
result = sum(1 if 't' in (x) else 0 for x in windows)

网友

2楼 · 编辑于 2024-04-20 02:27:21

您可以使用列表理解功能将数据分解为“窗口”列表：

windows: List[List[str]] = [data[i * 10:(i + 1) * 10] 
                            for i in range((len(data) + 10 - 1) // 10 )]

然后用同样的方法计算每个窗口的数量：

counts: List[int] = [window.count('t') 
                     for window in windows]

您没有指定打印输出的确切方式，所以我将剩下的留给您来确定，但请尝试print(counts)查看该格式是否适合您。你知道吗

网友

3楼 · 编辑于 2024-04-20 02:27:21

如果dna序列是一个字符串，那么textwrap.wrap文件它返回包装行的列表（尽管可能有内存方面的考虑）。所以你可以写：

>>> from textwrap import wrap
>>> dna = 'atgcttgcatgcttgcaaatgcatgcttgcattgcaa'
>>> [chunk.count('t') for chunk in wrap(dna, 10)]
[4, 3, 3, 2]

要获取块编号，可以使用枚举：

>>> print(*(f'On row #{i} "t" occured {chunk.count("t")} times' for i, chunk in enumerate(wrap(dna, 10), start=1)), sep='\n')
On row #1 "t" occured 4 times
On row #2 "t" occured 3 times
On row #3 "t" occured 3 times
On row #4 "t" occured 2 times

相关问题更多 >

编程相关推荐

热门问题

热门文章