在 Tensorflow 2.0 中使用 GradientTape() 和 jacobian() 時出錯

Question

我正在 Python 中的 Tensorflow 2.0 中使用 GradientTape() 和 jacobian()。

這段代碼執行得很好：

x = tf.Variable(2.0, dtype=tf.float32)
with tf.GradientTape() as gT:
    gT.watch(x)
    g = tf.convert_to_tensor([x, 0.0], dtype=tf.float32)
dg = gT.jacobian(g, x)

但是這段代碼中斷了：

x = tf.Variable(2.0, dtype=tf.float32)
with tf.GradientTape() as gT:
    gT.watch(x)
    gv = tf.Variable([x, 0.0], dtype=tf.float32)
    g = tf.convert_to_tensor(gv , dtype=tf.float32)
dg = gT.jacobian(g, x)

並拋出錯誤：

InvalidArgumentError：您必須使用 dtype int32 [[node loop_body/Placeholder（定義於 ...Anaconda3\\lib\\site-packages\\tensorflow_core\\python\\framework\\ops.py:1751）為占位符張量“loop_body/Placeholder”提供一個值]] [操作：__inference_f_995]

模塊中的回溯（最近一次調用） ipython-input-32-686c8a0d6e95
4 gv = tf.Variable([x, 0.0], dtype=tf.float32)
5 g = tf.convert_to_tensor(gv, dtype=tf.float32)
----> 6 dg = gT.jacobian(g, x)

為什么第一個代碼有效，但第二個代碼不起作用？

Answer 1

原因很簡單，

在第一個例子中，你得到

g = tf.convert_to_tensor([x, 0.0], dtype=tf.float32)

並計算dg/dx和g與x有直接關系並且工作正常。

但在第二個例子中，

gv = tf.Variable([x, 0.0], dtype=tf.float32)
g = tf.convert_to_tensor(gv , dtype=tf.float32)

g和x之間不再有聯系，因為當你打電話時，

gv = tf.Variable([x, 0.0], dtype=tf.float32)

它只是從x復制值並且不攜帶對x的引用，因此您無法獲得導數dg/dx 。 但是如果你嘗試dg/d(gv)它會起作用。

PS ：雖然我沒有收到錯誤（對於你的第二個例子）。 我剛剛得到None 。

在 Tensorflow 2.0 中使用 GradientTape() 和 jacobian() 時出錯

問題描述

1 個解決方案

解決方案1
1 2020-01-05 19:22:23

在 Tensorflow 2.0 中使用 GradientTape() 和 jacobian() 時出錯

問題描述

1 個解決方案

解決方案1 1 2020-01-05 19:22:23

解決方案1
1 2020-01-05 19:22:23