从 tf.gradients() 到 tf.GradientTape() 的转换返回 None

Question

我正在将一些 TF1 代码迁移到 TF2。 有关完整代码，您可以在此处查看第 [155-176] 行。 TF1 中有一条线在给定损失（浮点值）和 (m, n) 张量的情况下获得梯度

编辑：问题仍然存在

注意： TF2 代码应该兼容并且应该在tf.function工作

g = tf.gradients(-loss, f)  # loss being a float and f being a (m, n) tensor
k = -f_pol / (f + eps)  # f_pol another (m, n) tensor and eps a float
k_dot_g = tf.reduce_sum(k * g, axis=-1)
adj = tf.maximum(
    0.0,
    (tf.reduce_sum(k * g, axis=-1) - delta)
    / (tf.reduce_sum(tf.square(k), axis=-1) + eps),
)
g = g - tf.reshape(adj, [nenvs * nsteps, 1]) * k
grads_f = -g / (nenvs * nsteps)
grads_policy = tf.gradients(f, params, grads_f)  # params being the model parameters

在 TF2 代码中，我正在尝试：

with tf.GradientTape() as tape:
    f = calculate_f()
    f_pol = calculate_f_pol()
    others = do_further_calculations()
    loss = calculate_loss()
g = tape.gradient(-loss, f)

但是，无论我使用tape.watch(f)还是创建一个值为f的tf.Variable ，甚至在tf.function中使用tf.gradients() ，我都会不断得到g = [None] ，否则它会抱怨。

Answer 1

很可能是以下情况之一

在由@tf.funtion tf.Variable的 function 中区分 tf.Variable 吗？
一些变量是 numpy.array 而不是 tf.Tensor
您在装饰的 function 中更改了一些外部变量（即全局变量）。

从 tf.gradients() 到 tf.GradientTape() 的转换返回 None

问题描述

1 个解决方案

解决方案1
1 2021-03-02 06:23:27

从 tf.gradients() 到 tf.GradientTape() 的转换返回 None

问题描述

1 个解决方案

解决方案1 1 2021-03-02 06:23:27

解决方案1
1 2021-03-02 06:23:27