我的代码中有什么错误是错误的，错误会随着梯度下降的每次迭代而不断增加？

# Imports ```python import numpy as np import pandas as pd import matplotlib.pyplot as plt ``` # Model Preparation ## Gradient descent ```python def gradient_descent(m, theta, alpha, num_of_iterations, X, Y): # print(m, theta, alpha, num_of_iterations) for i in range(num_of_iterations): htheta_vector = np.dot(X,theta) # print(X.shape, theta.shape, htheta_vector.shape) error_vector = htheta_vector - Y gradient_vector = (1/m) * (np.dot(X.T, error_vector)) # each element in gradient_vector corresponds to each theta theta = theta - alpha * gradient_vector return theta ``` # Main ```python def main(): df = pd.read_csv('data2.csv', header = None) #loading data data = df.values # converting dataframe to numpy array X = data[:, 0:2] # print(X.shape) Y = data[:, -1] m = (X.shape)[0] # number of training examples Y = Y.reshape(m, 1) ones = np.ones(shape = (m,1)) X_with_bias = np.concatenate([ones, X], axis = 1) theta = np.zeros(shape = (3,1)) # two features, so three parameters alpha = 0.001 num_of_iterations = 400 theta = gradient_descent(m, theta, alpha, num_of_iterations, X_with_bias, Y) # calling gradient descent # print('Parameters learned: ' + str(theta)) if __name__ == '__main__': main() ```

2条回答

网友

1楼 · 编辑于 2024-04-23 21:07:51

请尝试使用特征规范化来解决此问题。只是特征值是一个很大的数字，当值很大时，代价函数（平方误差）会以很快的速度增加。一般来说，当您试图最小化非线性代价函数时，执行平均标准化和特征缩放。你知道吗

网友

2楼 · 编辑于 2024-04-23 21:07:51

进行特征规范化。Asummingthis是您的数据集，X的第一个维度以千为单位，第二个维度以万为单位，Y以十万为单位。使用sklearn.preprocessing.scale将所有数据列和目标设置为[0,1]，也可以使用此选项脏标准化：

 X[:,0] = X[:,0] / np.max( X[:,0])

 X[:,1] = X[:,1] / np.max( X[:,1])

 Y = Y / np.max(Y)

我用这些规范化程序重新运行你的代码。θ收敛到 [ 0.81705857], [ 0.98398577], [ 0.98398577]

为将来的问题提供数据文件或数据框摘要的链接。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章