位置tf.global_variables_初始值设定项()问题的回答

位置tf.global_variables_初始值设定项()

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我是深入学习的初学者，并坚持这个问题。在 <pre><code>import tensorflow as tf import numpy as np import pandas as pd from sklearn.preprocessing import LabelEncoder from sklearn.utils import shuffle from sklearn.model_selection import train_test_split #define the one hot encode function def one_hot_encode(labels): n_labels = len(labels) n_unique_labels = len(np.unique(labels)) one_hot_encode = np.zeros((n_labels,n_unique_labels)) one_hot_encode[np.arange(n_labels), labels] = 1 return one_hot_encode #Read the sonar <a href="https://www.cnpython.com/pypi/dataset" class="inner-link">dataset</a> df = pd.read_csv('sonar.csv') print(len(df.columns)) X = df[df.columns[0:60]].values y=df[df.columns[60]] #encode the dependent variable containing categorical values encoder = LabelEncoder() encoder.fit(y) y = encoder.transform(y) Y = one_hot_encode(y) #Transform the data in training and testing X,Y = shuffle(X,Y,random_state=1) train_x,test_x,train_y,test_y = train_test_split(X,Y,test_size=0.20, random_state=42) #define and initialize the variables to work with the tensors learning_rate = 0.1 training_epochs = 1000 #Array to store cost obtained in each epoch cost_history = np.empty(shape=[1],dtype=float) n_dim = X.shape[1] n_class = 2 x = tf.placeholder(tf.float32,[None,n_dim]) W = tf.Variable(tf.zeros([n_dim,n_class])) b = tf.Variable(tf.zeros([n_class])) #initialize all variables. #define the cost function y_ = tf.placeholder(tf.float32,[None,n_class]) y = tf.matmul(x, W)+ b init = tf.global_variables_initializer()#wrong position cost_function = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=y,labels=y_)) training_step = tf.train.AdamOptimizer(learning_rate).minimize(cost_function) init = tf.global_variables_initializer()#correct position #initialize the session sess = tf.Session() sess.run(init) mse_history = [] #calculate the cost for each epoch for epoch in range(training_epochs): sess.run(training_step,feed_dict={x:train_x,y_:train_y}) cost = sess.run(cost_function,feed_dict={x: train_x,y_: train_y}) cost_history = np.<a href="https://www.cnpython.com/list/append" class="inner-link">append</a>(cost_history,cost) print('epoch : ', epoch, ' - ', 'cost: ', cost) pred_y = sess.run(y, feed_dict={x: test_x}) print(pred_y) #Calculate Accuracy correct_prediction = tf.equal(tf.argmax(pred_y,1), tf.argmax(test_y,1)) accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32)) print(sess.run(accuracy)) sess.close() </code></pre> 如果我在上面的代码中使用init=tf.global_variables_初始值设定项() 上面AdamOptimizer那么它会给出错误，但是如果我在之后使用它的话 AdamOptimizer那么它工作正常。原因是什么？尽管在这两个位置都使用GradientDescentOptimizer，它都能很好地工作。在

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

看一下<a href="http://devdocs.io/tensorflow~python/tf/global_variables_initializer" rel="nofollow noreferrer">documentation</a>，<code>init = tf.global_variables_initializer()</code>和{<cd2>}是一样的 <a href="http://devdocs.io/tensorflow~python/tf/train/adamoptimizer" rel="nofollow noreferrer">^{<cd3>}</a>需要初始化一些内部变量（平均值统计等） <pre><code><tf.Variable 'beta1_power:0' shape=() dtype=float32_ref> <tf.Variable 'beta2_power:0' shape=() dtype=float32_ref> <tf.Variable 'x/Adam:0' shape=(2, 1) dtype=float32_ref> # 1st moment vector <tf.Variable 'x/Adam_1:0' shape=(2, 1) dtype=float32_ref> # 2nd moment vector </code></pre> <a href="http://devdocs.io/tensorflow~python/tf/train/adamoptimizer" rel="nofollow noreferrer">documentation</a>告诉您如何应用更新。在 相反，香草梯度下降优化器<a href="http://devdocs.io/tensorflow~python/tf/train/gradientdescentoptimizer" rel="nofollow noreferrer">^{<cd4>}</a>不依赖于任何变量。有区别。现在，在<a href="http://devdocs.io/tensorflow~python/tf/train/adamoptimizer" rel="nofollow noreferrer">^{<cd3>}</a>可以使用其变量之前，需要在某个时刻初始化这些变量。在 要创建初始化所有所需变量的操作<code>init</code>，此操作<code>init</code>需要知道运行程序所需的变量。因此，它需要放在<a href="http://devdocs.io/tensorflow~python/tf/train/adamoptimizer" rel="nofollow noreferrer">^{<cd3>}</a>之后。在 如果你把<code>init = tf.global_variables_initializer()</code>放在<a href="http://devdocs.io/tensorflow~python/tf/train/adamoptimizer" rel="nofollow noreferrer">^{<cd3>}</a>之前 ^{pr2}$ 你会得到 <pre><code>Attempting to use uninitialized value beta1_power </code></pre> 它告诉您，<a href="http://devdocs.io/tensorflow~python/tf/train/adamoptimizer" rel="nofollow noreferrer">^{<cd3>}</a>试图访问尚未初始化的<code><tf.Variable 'beta1_power:0' shape=() dtype=float32_ref></code>。在 所以 <pre><code># ... ... = tf.train.AdamOptimizer(0.1).minimize(cost_function) # ... init = tf.global_variables_initializer() </code></pre> 是唯一正确的方法。您可以检查，哪些变量可以通过 <pre><code>for variable in tf.global_variables(): print(variable) </code></pre> 源代码中。在 考虑最小化二次型<code>0.5x'Ax + bx + c</code>的例子。在TensorFlow中 <pre><code>import tensorflow as tf import numpy as np x = tf.Variable(np.random.rand(2, 1), dtype=tf.float32, name="x") # we already make clear, that we are not going to optimize these variables b = tf.constant([[5], [6]], dtype=tf.float32, name="b") A = tf.constant([[9, 2], [2, 10]], dtype=tf.float32, name="A") cost_function = 0.5 * tf.matmul(tf.matmul(tf.transpose(x), A), x) - tf.matmul(tf.transpose(b), x) + 42 for variable in tf.global_variables(): print('before ADAM: global_variables_initializer would init {}'.format(variable)) optimize_op = tf.train.AdamOptimizer(0.1).minimize(cost_function) for variable in tf.global_variables(): print('after ADAM: global_variables_initializer would init </code></pre> {}'。格式（变量） <pre><code>init_op = tf.variables_initializer(tf.global_variables()) with tf.Session() as sess: sess.run(init_op) for i in range(5): loss, _ = sess.run([cost_function, optimize_op]) print(loss) </code></pre> 输出是 <pre><code>before ADAM global_variables_initializer would init <tf.Variable 'x:0' shape=(2, 1) dtype=float32_ref> after ADAM global_variables_initializer would init <tf.Variable 'x:0' shape=(2, 1) dtype=float32_ref> after ADAM global_variables_initializer would init <tf.Variable 'beta1_power:0' shape=() dtype=float32_ref> after ADAM global_variables_initializer would init <tf.Variable 'beta2_power:0' shape=() dtype=float32_ref> after ADAM global_variables_initializer would init <tf.Variable 'x/Adam:0' shape=(2, 1) dtype=float32_ref> after ADAM global_variables_initializer would init <tf.Variable 'x/Adam_1:0' shape=(2, 1) dtype=float32_ref> </code></pre> 因此，当在ADAM定义<code>tf.train.AdamOptimizer</code>之前放置{<cd1>}时，<code>tf.global_variables_initializer()</code>看不到ADAM所需的变量。使用GradientDescentOptimizer时，值为 <pre><code>before ADAM global_variables_initializer would init <tf.Variable 'x:0' shape=(2, 1) dtype=float32_ref> after ADAM global_variables_initializer would init <tf.Variable 'x:0' shape=(2, 1) dtype=float32_ref> </code></pre> 所以优化器前后没有任何变化。在

位置tf.global_variables_初始值设定项()

1 个回答

相关Python问题