将Numpy Lstsq残差值转换为R^2
我正在进行一个最小二乘回归分析,主要是处理一个变量的数据。我想用R^2这个值来表达结果的重要性。Numpy给出的结果是未缩放的残差,我该如何合理地对这个值进行标准化呢?
field_clean,back_clean = rid_zeros(backscatter,field_data)
num_vals = len(field_clean)
x = field_clean[:,row:row+1]
y = 10*log10(back_clean)
A = hstack([x, ones((num_vals,1))])
soln = lstsq(A, y )
m, c = soln [0]
residues = soln [1]
print residues
1 个回答
23
请查看这个链接:http://en.wikipedia.org/wiki/Coefficient_of_determination
你的R2值等于:
1 - residual / sum((y - y.mean())**2)
这等同于:
1 - residual / (n * y.var())
举个例子:
import numpy as np
# Make some data...
n = 10
x = np.arange(n)
y = 3 * x + 5 + np.random.random(n)
# Note that polyfit is an easier way to do this...
# It would just be "model, resid = np.polyfit(x,y,1,full=True)[:2]"
A = np.vstack((x, np.ones(n))).T
model, resid = np.linalg.lstsq(A, y)[:2]
r2 = 1 - resid / (y.size * y.var())
print r2