在 Tensorflow 2.0 中使用 GradientTape() 和 jacobian() 时出错

Question

I am working with GradientTape() and jacobian() in Tensorflow 2.0 in Python.我正在 Python 中的 Tensorflow 2.0 中使用 GradientTape() 和 jacobian()。

This code executes fine:这段代码执行得很好：

x = tf.Variable(2.0, dtype=tf.float32)
with tf.GradientTape() as gT:
    gT.watch(x)
    g = tf.convert_to_tensor([x, 0.0], dtype=tf.float32)
dg = gT.jacobian(g, x)

But this code breaks:但是这段代码中断了：

x = tf.Variable(2.0, dtype=tf.float32)
with tf.GradientTape() as gT:
    gT.watch(x)
    gv = tf.Variable([x, 0.0], dtype=tf.float32)
    g = tf.convert_to_tensor(gv , dtype=tf.float32)
dg = gT.jacobian(g, x)

and throws the error:并抛出错误：

InvalidArgumentError: You must feed a value for placeholder tensor 'loop_body/Placeholder' with dtype int32 [[node loop_body/Placeholder (defined at ...Anaconda3\\lib\\site-packages\\tensorflow_core\\python\\framework\\ops.py:1751) ]] [Op:__inference_f_995] InvalidArgumentError：您必须使用 dtype int32 [[node loop_body/Placeholder（定义于 ...Anaconda3\\lib\\site-packages\\tensorflow_core\\python\\framework\\ops.py:1751）为占位符张量“loop_body/Placeholder”提供一个值]] [操作：__inference_f_995]

Traceback (most recent call last) ipython-input-32-686c8a0d6e95 in module模块中的回溯（最近一次调用） ipython-input-32-686c8a0d6e95
4 gv = tf.Variable([x, 0.0], dtype=tf.float32) 4 gv = tf.Variable([x, 0.0], dtype=tf.float32)
5 g = tf.convert_to_tensor(gv , dtype=tf.float32) 5 g = tf.convert_to_tensor(gv, dtype=tf.float32)
----> 6 dg = gT.jacobian(g, x) ----> 6 dg = gT.jacobian(g, x)

Why does the first code work, but the second code doesn't?为什么第一个代码有效，但第二个代码不起作用？

Answer 1

The reason is pretty simple,原因很简单，

In the first example, you got在第一个例子中，你得到

g = tf.convert_to_tensor([x, 0.0], dtype=tf.float32)

and compute dg/dx and g has a direct relationship with x and works fine.并计算dg/dx和g与x有直接关系并且工作正常。

But in the second example,但在第二个例子中，

gv = tf.Variable([x, 0.0], dtype=tf.float32)
g = tf.convert_to_tensor(gv , dtype=tf.float32)

There's no connection between g and x any more, because when you call, g和x之间不再有联系，因为当你打电话时，

gv = tf.Variable([x, 0.0], dtype=tf.float32)

it just copies the values from x and doesn't carry a reference to x , so you cannot get derivative dg/dx .它只是从x复制值并且不携带对x的引用，因此您无法获得导数dg/dx 。 But if you try dg/d(gv) it'll work.但是如果你尝试dg/d(gv)它会起作用。

PS : I didn't get an error though (for your second example). PS ：虽然我没有收到错误（对于你的第二个例子）。 I just got None .我刚刚得到None 。

在 Tensorflow 2.0 中使用 GradientTape() 和 jacobian() 时出错

问题描述

1 个解决方案

解决方案1
1 2020-01-05 19:22:23

在 Tensorflow 2.0 中使用 GradientTape() 和 jacobian() 时出错

问题描述

1 个解决方案

解决方案1 1 2020-01-05 19:22:23

解决方案1
1 2020-01-05 19:22:23