为什么张量流代码中的训练模型的准确性没有改变？

Question

I am a novice of the tensorflow and python. 我是tensorflow和python的新手。 I modified a sample tensorflow code by adding one hidden layer with 50 units, but the accuracy result turned to be wrong and it was not changed no matter how many times the model do training. 我通过添加一个包含50个单位的隐藏层来修改了一个示例tensorflow代码，但是精度结果变成错误的，并且无论模型进行多少次训练，它都没有改变。 I cannot find any problem with the code. 我找不到任何代码问题。 The dataset is MNIST: 数据集是MNIST：

import tensorflow as tf
from tensorflow.examples.tutorials.mnist import input_data

mnist = input_data.read_data_sets("MNIST_data", one_hot = True)

batch_size = 100
n_batch = mnist.train.num_examples // batch_size

x = tf.placeholder(tf.float32, [None, 784])
y = tf.placeholder(tf.float32, [None, 10])


W = tf.Variable(tf.zeros([784, 50]))
b = tf.Variable(tf.zeros([50]))

Wx_plus_b_L1 = tf.matmul(x,W) + b
L1 = tf.nn.relu(Wx_plus_b_L1)

W_2 = tf.Variable(tf.zeros([50, 10]))
b_2 = tf.Variable(tf.zeros([10]))

prediction = tf.nn.softmax(tf.matmul(L1, W_2) + b_2)


loss = tf.reduce_mean(tf.square(y - prediction))
train_step = tf.train.GradientDescentOptimizer(0.2).minimize(loss)


correct_prediction = tf.equal(tf.argmax(y,1), tf.argmax(prediction,1))

accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

init = tf.global_variables_initializer()


with tf.Session() as sess:
   sess.run(init)
   for epoch in range(21):
    for batch in range(n_batch):
        batch_xs, batch_ys = mnist.train.next_batch(batch_size)
        sess.run(train_step, feed_dict={x:batch_xs, y:batch_ys})
    acc = sess.run(accuracy, feed_dict = {x:mnist.test.images, y:mnist.test.labels})
    print("Iter:" + str(epoch) + ", Testing Accuray:" + str(acc))

The output always be the same accuracy: Iter:0, Testing Accuray:0.1135 2018-05-31 18:05:21.039188: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. Iter:1, Testing Accuray:0.1135 2018-05-31 18:05:22.551525: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. Iter:2, Testing Accuray:0.1135 2018-05-31 18:05:24.070686: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. 输出始终是相同的精度： Iter:0, Testing Accuray:0.1135 2018-05-31 18:05:21.039188: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. Iter:1, Testing Accuray:0.1135 2018-05-31 18:05:22.551525: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. Iter:2, Testing Accuray:0.1135 2018-05-31 18:05:24.070686: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. Iter:0, Testing Accuray:0.1135 2018-05-31 18:05:21.039188: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. Iter:1, Testing Accuray:0.1135 2018-05-31 18:05:22.551525: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. Iter:2, Testing Accuray:0.1135 2018-05-31 18:05:24.070686: W tensorflow/core/framework/allocator.cc:101] Allocation of 31360000 exceeds 10% of system memory. What's wrong in this code? 这段代码有什么问题？ Thank you~~ 谢谢~~

Answer 1

i think it has to do with the graph. 我认为这与图表有关。 accuracy is never updated as the only op you are calling that gets updated change this code to 准确性永远不会更新，因为您正在调用的唯一操作已被更新，请将此代码更改为

with tf.Session() as sess:
   sess.run(init)
   for epoch in range(21):
    for batch in range(n_batch):
        batch_xs, batch_ys = mnist.train.next_batch(batch_size)
        sess.run([train_step,accuracy], feed_dict={x:batch_xs, y:batch_ys})
    acc = sess.run(accuracy, feed_dict = {x:mnist.test.images, y:mnist.test.labels})
    print("Iter:" + str(epoch) + ", Testing Accuray:" + str(acc))

Answer 2

The reason is I initialize all weights and bias to zero. 原因是我初始化了所有权重并将零偏。 If that so, all of the output of the neurons will be the same. 如果这样，神经元的所有输出将是相同的。 The back propagation behavior of all neurons within the same layer is the same - the same gradient, weight update is the same.This is clearly an unacceptable result. 同一层内所有神经元的反向传播行为都相同-相同的梯度，权重更新相同，这显然是不可接受的结果。

Answer 3

I have had the same problem with Titanic dataset. 我在Titanic数据集上遇到了同样的问题。 What helped was to change the learning rate: 帮助了学习率的改变：

optimize = tf.train.AdamOptimizer(learning_rate=0.000001).minimize(mean_loss)

When I changed it from 0.001, the accuracy finally started changing. 当我从0.001更改时，精度最终开始变化。 Before that, I have tried to play with the number of layers , batch size , hidden layer size but nothing helped. 在此之前，我尝试使用层数， 批处理大小 ， 隐藏层大小，但没有任何帮助。

为什么张量流代码中的训练模型的准确性没有改变？

问题描述

3 个解决方案

解决方案1
0 2018-06-01 02:42:50

解决方案2
0 已采纳 2018-06-01 06:56:59

解决方案3
0 2018-12-21 03:08:11

为什么张量流代码中的训练模型的准确性没有改变？

问题描述

3 个解决方案

解决方案1 0 2018-06-01 02:42:50

解决方案2 0 已采纳 2018-06-01 06:56:59

解决方案3 0 2018-12-21 03:08:11

解决方案1
0 2018-06-01 02:42:50

解决方案2
0 已采纳 2018-06-01 06:56:59

解决方案3
0 2018-12-21 03:08:11