简体   繁体   English

如何在Tensorflow中使用预训练模型?

[英]How to use a pretrained model with Tensorflow?

I know that the following is an already answered question, but even though i tried and tried all the proposed solutions, none of them solved my problem. 我知道以下是已经回答的问题,但是即使我尝试了所有建议的解决方案,也没有一个解决了我的问题。 I made this net for training over MNIST dataset. 我制作了这个网络,用于训练MNIST数据集。 At the beginning it was deeper, but in order to focus on the problem i simplified it. 在一开始它比较深入,但是为了专注于问题,我简化了它。

mnist = mnist_data.read_data_sets('MNIST_data', one_hot=True)

# train the net
def train():
    for i in range(1000):
        batch_xs, batch_ys = mnist.train.next_batch(100)
        sess.run(train_step, feed_dict={x: batch_xs, y_: batch_ys})
        print("accuracy", sess.run(accuracy, feed_dict={x: mnist.test.images, y_: mnist.test.labels}))
        if i%100==0:
            save_path = saver.save(sess, "./tmp/model.ckpt", global_step = i, write_meta_graph=True)    
            print("Model saved in file: %s" % save_path)

# evaluate the net
def test(image, label):
    true_value = tf.argmax(label, 1)
    prediction = tf.argmax(y, 1)
    print("true value:", sess.run(true_value))
    print("predictions", sess.run(prediction, feed_dict={x:image}))

sess = tf.InteractiveSession()

x = tf.placeholder("float", shape=[None, 784])
W = tf.Variable(tf.zeros([784,10]), name = "W1")
b = tf.Variable(tf.zeros([10]), name = "B1")
y = tf.nn.softmax(tf.matmul(x,W) + b, name ="Y")
y_ = tf.placeholder("float", shape=[None, 10])
cross_entropy = -tf.reduce_sum(y_*tf.log(y))
train_step = tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy)
correct_prediction = tf.equal(tf.argmax(y,1), tf.argmax(y_,1))
accuracy = tf.reduce_mean(tf.cast(correct_prediction, "float"))

saver = tf.train.Saver()
model_to_restore="./tmp/model.ckpt-100.meta"
if os.path.isfile(model_to_restore):
    #what i have to do here?????#
else:
#this part works!#
    print("Model does not exist: training")
    train()

Thanks everybody for the answers! 谢谢大家的回答!

Regards, 问候,

Silvio 西尔维奥

UPDATE UPDATE

  • I tried both 我都尝试过

     saver.restore(sess, model_to_restore) 

    and

     saver = tf.train.import_meta_graph(model_to_restore) saver.restore(sess, model_to_restore) 

    but in both cases i had this error from terminal: 但在两种情况下,我都从终端收到此错误:

     DataLossError (see above for traceback): Unable to open table file ./tmp/model.ckpt.meta: Data loss: not an sstable (bad magic number): perhaps your file is in a different file format and you need to use a different restore operator? [[Node: save/RestoreV2 = RestoreV2[dtypes=[DT_FLOAT], _device="/job:localhost/replica:0/task:0/cpu:0"](_recv_save/Const_0, save/RestoreV2/tensor_names, save/RestoreV2/shape_and_slices)]] 

I think your location to the model might be wrong and i would advise you to give the following workflow a try. 我认为您在模型上的位置可能有误,我建议您尝试以下工作流程。

Since the saved models comprise several files i usually save them to a folder after training: 由于保存的模型包含多个文件,因此我通常在训练后将它们保存到文件夹中:

modelPath = "myMNIST/model"
saved_path = saver.save(sess, os.path.join(modelPath, "model.ckpt"))
print("Model saved in file: ", saved_path)

This will also tell you the exact location where it has been saved. 这还将告诉您确切的保存位置。

Then i can start my predictor inside the saved location (cd into myMNIST) and restore the model by: 然后,我可以在保存的位置(将CD放入myMNIST)中启动我的预测变量,并通过以下方式恢复模型:

ckpt = tf.train.get_checkpoint_state("./model")
if ckpt and ckpt.model_checkpoint_path:
    print("Restored Model")
    saver.restore(sess, ckpt.model_checkpoint_path)
else:
    print("Could not restore model!")

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM