简体   繁体   English

Tensorflow,在另一个tf.estimator model_fn中使用经过tf.estimator训练的模型

[英]Tensorflow, use a tf.estimator trained model within another tf.estimator model_fn

Is there a way to use tf.estimator trained model A in another model B? 有没有办法在另一个模型B中使用tf.estimator训练的模型A?

Here is situation, Let say I have a trained 'Model A' with model_a_fn(). 这是一种情况,假设我有一个受过训练的“ model A”,带有model_a_fn()。 'Model A' gets images as input, and outputs some vector floating values similar to MNIST classifier. “模型A”获取图像作为输入,并输出一些类似于MNIST分类器的矢量浮点值。 And there is another 'Model B' which is defined in model_b_fn(). 在model_b_fn()中定义了另一个“模型B”。 It also gets images as input, and needs vector output of 'Model A' while training 'Model B'. 它也获取图像作为输入,并在训练“模型B”时需要“模型A”的矢量输出。

So basically I want to train 'Model B' that need inputs as images & prediction output of 'Model A'. 因此,基本上我想训练需要输入作为“模型A”的图像和预测输出的“模型B”。 (No need to train 'Model A' anymore, only to get prediction output while training 'Model B') (不再需要训练“模型A”,只需要在训练“模型B”时获得预测输出即可)

I've tried three cases: 我尝试了三种情况:

  1. Use estimator object('Model A') inside model_b_fn() 在model_b_fn()中使用估算器对象(“ Model A”)
  2. Exported 'Model A' with tf.estimator.export_savedmodel(), and create prediction function. 使用tf.estimator.export_savedmodel()导出了“模型A”,并创建了预测函数。 Passed it to model_b_fn() with params dict. 使用paras dict将其传递给model_b_fn()。
  3. Same as 2, but restore 'Model A' inside model_b_fn() 与2相同,但在model_b_fn()中还原“模型A”

But all cases shows errors: 但是所有情况都显示错误:

  1. ... must be from the same graph as ... ...必须与...具有相同的图形
  2. TypeError: can't pickle _thread.RLock objects TypeError:无法腌制_thread.RLock对象
  3. TypeError: The value of a feed cannot be a tf.Tensor object. TypeError:供稿的值不能是tf.Tensor对象。

And here is my code I used... only attaching important parts 这是我使用的代码...仅附加重要部分

train_model_a.py train_model_a.py

def model_a_fn(features, labels, mode, params):
    # ...
    # ...
    # ...
    return

def main():
    # model checkpoint location
    model_a_dir = './model_a'

    # create estimator for Model A
    model_a = tf.estimator.Estimator(model_fn=model_a_fn, model_dir=model_a_dir)

    # train Model A
    model_a.train(input_fn=lambda : input_fn_a)
    # ...
    # ...
    # ...

    # export model a
    model_a.export_savedmodel(model_a_dir, serving_input_receiver_fn=serving_input_receiver_fn)
    # exported to ./model_a/123456789
    return

if __name__ == '__main__':
    main()

train_model_b_case_1.py train_model_b_case_1.py

# follows model_a's input format
def bypass_input_fn(x):
    features = {
        'x': x,
    }
    return features

def model_b_fn(features, labels, mode, params):
    # parse input
    inputs = tf.reshape(features['x'], shape=[-1, 28, 28, 1])

    # get Model A's response
    model_a = params['model_a']
    predictions = model_a.predict(
        input_fn=lambda: bypass_input_fn(inputs)
    )
    for results in predictions:
        # Error occurs!!!
        model_a_output = results['class_id']

    # build Model B
    layer1 = tf.layers.conv2d(inputs, 32, 5, same, activation=tf.nn.relu)
    layer1 = tf.layers.max_pooling2d(layer1, pool_size=[2, 2], strides=2)

    # ...
    # some layers added...
    # ...

    flatten = tf.layers.flatten(prev_layer)
    layern = tf.layers.dense(10)

    # let say layern's output shape and model_a_output's output shape is same
    add_layer = tf.add(flatten, model_a_output)

    # ...
    # do more... stuff
    # ...
    return

def main():
    # load pretrained model A
    model_a_dir = './model_a'
    model_a = tf.estimator.Estimator(model_fn=model_a_fn, model_dir=model_a_dir)

    # model checkpoint location
    model_b_dir = './model_b/'

    # create estimator for Model A
    model_b = tf.estimator.Estimator(
        model_fn=model_b_fn,
        model_dir=model_b_dir,
        params={
            'model_a': model_a,
        }
    )

    # train Model B
    model_b.train(input_fn=lambda : input_fn_b)
    return

if __name__ == '__main__':
    main()

train_model_b_case_2.py train_model_b_case_2.py

def model_b_fn(features, labels, mode, params):
    # parse input
    inputs = tf.reshape(features['x'], shape=[-1, 28, 28, 1])

    # get Model A's response
    model_a_predict_fn = params['model_a_predict_fn']
    model_a_prediction = model_a_predict_fn(
        {
            'x': inputs
        }
    )
    model_a_output = model_a_prediction['output']

    # build Model B
    layer1 = tf.layers.conv2d(inputs, 32, 5, same, activation=tf.nn.relu)
    layer1 = tf.layers.max_pooling2d(layer1, pool_size=[2, 2], strides=2)

    # ...
    # some layers added...
    # ...

    flatten = tf.layers.flatten(prev_layer)
    layern = tf.layers.dense(10)

    # let say layern's output shape and model_a_output's output shape is same
    add_layer = tf.add(flatten, model_a_output)

    # ...
    # do more... stuff
    # ...
    return

def main():
    # load pretrained model A
    model_a_dir = './model_a/123456789'
    model_a_predict_fn = tf.contrib.predictor.from_saved_model(export_dir=model_a_dir)

    # model checkpoint location
    model_b_dir = './model_b/'

    # create estimator for Model A
    # Error occurs!!!
    model_b = tf.estimator.Estimator(
        model_fn=model_b_fn,
        model_dir=model_b_dir,
        params={
            'model_a_predict_fn': model_a_predict_fn,
        }
    )

    # train Model B
    model_b.train(input_fn=lambda : input_fn_b)
    return

if __name__ == '__main__':
    main()

train_model_b_case_3.py train_model_b_case_3.py

def model_b_fn(features, labels, mode, params):
    # parse input
    inputs = tf.reshape(features['x'], shape=[-1, 28, 28, 1])

    # get Model A's response
    model_a_predict_fn = tf.contrib.predictor.from_saved_model(export_dir=params['model_a_dir'])
    # Error occurs!!!
    model_a_prediction = model_a_predict_fn(
        {
            'x': inputs
        }
    )
    model_a_output = model_a_prediction['output']

    # build Model B
    layer1 = tf.layers.conv2d(inputs, 32, 5, same, activation=tf.nn.relu)
    layer1 = tf.layers.max_pooling2d(layer1, pool_size=[2, 2], strides=2)

    # ...
    # some layers added...
    # ...

    flatten = tf.layers.flatten(prev_layer)
    layern = tf.layers.dense(10)

    # let say layern's output shape and model_a_output's output shape is same
    add_layer = tf.add(flatten, model_a_output)

    # ...
    # do more... stuff
    # ...
    return

def main():
    # load pretrained model A
    model_a_dir = './model_a/123456789'

    # model checkpoint location
    model_b_dir = './model_b/'

    # create estimator for Model A
    # Error occurs!!!
    model_b = tf.estimator.Estimator(
        model_fn=model_b_fn,
        model_dir=model_b_dir,
        params={
            'model_a_dir': model_a_dir,
        }
    )

    # train Model B
    model_b.train(input_fn=lambda : input_fn_b)
    return

if __name__ == '__main__':
    main()

So any idea on using trained custom tf.estimator in another tf.estimator please?? 因此,关于在另一个tf.estimator中使用训练有素的自定义tf.estimator的任何想法请吗?

I've figured out one solution to this problem. 我已经找到解决这个问题的一种方法。

One can use this method if struggling with same problem. 如果遇到同样的问题,可以使用这种方法。

  1. create a function which runs tensorflow.contrib.predictor.from_saved_model() -> call it 'pretrained_predictor()' 创建一个运行tensorflow.contrib.predictor.from_saved_model()的函数->将其称为'pretrained_predictor()'
  2. inside Model B's model_fn(), call above predefined 'pretrained_predictor()' wrap it with tensorflow.py_func() 在模型B的model_fn()中,在预定义的'pretrained_predictor()'上方调用tensorflow.py_func()进行包装

For example case, see https://github.com/moono/tf-cnn-mnist/blob/master/4_3_estimator_within_estimator.py for simple use case. 例如,请参阅https://github.com/moono/tf-cnn-mnist/blob/master/4_3_estimator_within_estimator.py了解简单用例。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM