我有这样的输入文件
@a
CTCTCTTTAGAACAATCATCACATACCCCTGGTT
+
1>1A>FDF3113B11BB1BAED1D11BAEE0ABG
@b
CAATGCAGCGAACAATGTTCTGGTGGTTGAATTT
+
111>1@11C?1AE?BFDGGGGGBGCCGGGGHHHH
....
我计算了这个文件中的平均质量分数,但需要在计算标准的帮助 分数的偏差
这就是我所做的
import sys,csv
import numpy as np
import math
r = open(sys.argv[1],"r")
length =200
a = np.zeros(length)
b = np.zeros(length)
av = np.zeros(length)
while True:
id = reads.readline().rstrip()
if id == "":
break
seq= r.readline().rstrip()
p = r.readline().rstrip()
qua = r.readline().rstrip()
l= len(qua)
q = [ord(character) - 33 for character in qua]
for i in range(l):
a[i] += q[i]
b[i] += 1
for j in range(length):
av[j] = float(a[j]) / b[j]
s2 = sum((x - av[j])**2 for x in q[i])/b[j] # error in this
standard_deviation = s2**0.5
print standard_deviation
这个怎么样?你知道吗
相关问题 更多 >
编程相关推荐