擅长:python、mysql、java
<p>为了解决性能问题,您可能希望尽可能多地使用预编译的C模块。使用regex有一个解决方案:</p>
<pre><code>seq = 'CCTGCGGAAGATCGGCACTAGAATAGCCAGAACCGTTTCTCTGAGGCTTCCGGCCTTCCCTCCCACTAATAATTCTGAGG'
import re
perc = re.subn(r'[GC]', '', seq) / len(seq)
</code></pre>
<p>并处理“>;”行:</p>
<pre><code>seq = []
name = ''
for line in open('Rosalind_gc.txt'):
if not line.startswith('>'):
seq.append(line.strip())
else:
if seq:
seq = ''.join(seq)
perc = re.subn(r'[GC]', '', seq) / len(seq)
print('{} has GC percent: {}'.format(name, perc * 100))
seq = []
name = line.strip()
</code></pre>