tensorflow：使用tf.estimator和keras进行词汇查询

Question

I have the following dataset 我有以下数据集

username,itemname,value
"carl","socks",12.50
"john doe","shirts",30.00
...

I also have the following vocabulary lookup files 我也有以下词汇查询文件

usernames.txt usernames.txt

carl
john doe
bob smith
...

itemnames.txt itemnames.txt

socks
shirts
shoes
...

I will be receiving the strings at prediction time. 我将在预测时间接收字符串。 There is no way around that. 没有办法解决。 In order to make training similar I am using tf.contrib.lookup 为了使训练相似，我正在使用tf.contrib.lookup

import tf.contrib.lookup

user_lookup = tf.contrib.lookup.index_table_from_file(
    vocabulary_file='usernames.txt'
)

item_lookup = tf.contrib.lookup.index_table_from_file(
    vocabulary_file='itemnames.txt'
)

Now I have the following model defined using the keras api 现在，我使用keras api定义了以下模型

import tensorflow as tf

user_input = tf.keras.layers.Input(shape=(1,), dtype=tf.int32)
item_input = tf.keras.layers.Input(shape=(1,), dtype=tf.int32)

user_embedding = tf.keras.layers.Embedding(input_dim=num_users, output_dim=10)(user_input)
item_embedding = tf.keras.layers.Embedding(input_dim=num_items, output_dim=10)(item_input)

...
output = ...
model = tf.keras.Model([user_input, item_input], output)
model.compile(...)

I am using tf.estimator for training and prediction. 我正在使用tf.estimator进行训练和预测。 So my first instinct is to do the following: 因此，我的第一个直觉是执行以下操作：

my_estimator = tf.keras.estimator.model_to_estimator(keras_model=model)

tf.tables_initializer()

def train_fn(dataset_iterator):
     (username, itemname), value = dataset_iterator.get_next()
     userid = user_lookup.lookup(username)
     itemid = item_lookup.lookup(itemname)
     return (username, itemname), value

my_train_spec = tf.estimator.TrainSpec(
   input_fn=train_fn(train_data)
)

my_eval_spec = tf.estimator.EvalSpec(
   input_fn=train_fn(validation_data)
)

tf.estimator.train_and_evaluate(
    estimator=my_estimator,
    train_spec=my_train_spec,
    eval_spec=my_eval_spec
)

When I run this I get the follow error: 运行此命令时，出现以下错误：

ValueError: Tensor("Cast_2:0", shape=(), dtype=int32) must be from the same graph as Tensor("Item-Embedding-LMF/embeddings/Read/ReadVariableOp:0", shape=(429099, 10), dtype=float32, device=/job:ps/task:1).

Can anyone recommend a solution to this problem? 谁能推荐解决此问题的方法？ Or maybe even a different approach to handling this lookup? 也许甚至还有其他方法来处理此查找？

Answer 1

Mostly, no issue with the lookup. 通常，查找没有问题。 The all the variaables should be asscociated with the same graphs May be write the model in the scope eg 所有变量都应使用相同的图进行关联。可以将模型写入范围内，例如

def model_fn: 
    with tf.variable_scope('my_model', reuse=tf.AUTO_REUSE):
    ....
    ..
     return estimator

tensorflow：使用tf.estimator和keras进行词汇查询

问题描述

1 个解决方案

解决方案1
0 2019-08-22 19:36:47

tensorflow：使用tf.estimator和keras进行词汇查询

问题描述

1 个解决方案

解决方案1 0 2019-08-22 19:36:47

解决方案1
0 2019-08-22 19:36:47