简体繁体 English

什么时候在 Tensorflow Gradient Tape 中应用 Momentum？

[英]When is Momentum Applied in Tensorflow Gradient Tape?

原文 2020-09-16 18:47:01 5 1 python/ tensorflow/ adam

I've been playing around with automatic gradients in tensorflow and I had a question.我一直在玩 tensorflow 中的自动梯度，我有一个问题。 If we are updating an optimizer, say ADAM, when is the momentum algorithm applied to the gradient?如果我们正在更新优化器，比如 ADAM，那么动量算法何时应用于梯度？ Is it applied when we call tape.gradient(loss,model.trainable_variables) or when we call model.optimizer.apply_gradients(zip(dtf_network,model.trainable_variables))?它是在我们调用 tape.gradient(loss,model.trainable_variables) 还是调用 model.optimizer.apply_gradients(zip(dtf_network,model.trainable_variables)) 时应用的？

Thanks!谢谢！

1 个解决方案

tape.gradient computes the gradients straightforwardly without reference to an optimizer. tape.gradient计算梯度，无需参考优化器。 Since momentum is part of the optimizer, the tape does not include it.由于势头优化的一部分，带不包括它。 AFAIK momentum is usually implemented by adding extra variables in the optimizer that store the running average. AFAIK 动量通常是通过在存储运行平均值的优化器中添加额外变量来实现的。 All of this is handled in optimizer.apply_gradients .所有这些都在optimizer.apply_gradients处理。

Tensorflow 渐变胶带退货 null - Tensorflow Gradient Tape returns null

Tensorflow 渐变胶带返回无 - Tensorflow Gradient Tape returning None

为什么这个 Tensorflow 梯度带返回 None？ - Why is this Tensorflow gradient tape returning None?

Tensorflow 渐变带“未连接渐变的未知值” - Tensorflow Gradient tape 'unknown value for unconnected gradients'

加速 Tensorflow 2.0 Gradient Tape - Speeding up Tensorflow 2.0 Gradient Tape

memory OOM 使用 tensorflow 渐变胶带但仅在我使用 append 列表时发生 - Out of memory OOM using tensorflow gradient tape but only happens when I append a list

tensorflow中的渐变胶带可以区分同一网络的嵌套函数吗？ - Can gradient tape in tensorflow differentiate nested functions of the same network?

如何使用 TensorFlow 的梯度带对新数据集进行预测 - How to make predictions on new dataset with tensorflow's gradient tape

具有动量的梯度下降 - Gradient descent with momentum

Tensorflow Keras 对于一个 model 的可训练变量受其他 Z49DDB8F35E630FCC3 的可训练变量影响，渐变磁带返回 None - Tensorflow Keras Gradient Tape returns None for a trainable variable of one model which is impacted by trainable variable of other model

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Tensorflow 渐变胶带退货 null - Tensorflow Gradient Tape returns null Tensorflow 渐变胶带返回无 - Tensorflow Gradient Tape returning None 为什么这个 Tensorflow 梯度带返回 None？ - Why is this Tensorflow gradient tape returning None? Tensorflow 渐变带“未连接渐变的未知值” - Tensorflow Gradient tape 'unknown value for unconnected gradients' 加速 Tensorflow 2.0 Gradient Tape - Speeding up Tensorflow 2.0 Gradient Tape memory OOM 使用 tensorflow 渐变胶带但仅在我使用 append 列表时发生 - Out of memory OOM using tensorflow gradient tape but only happens when I append a list tensorflow中的渐变胶带可以区分同一网络的嵌套函数吗？ - Can gradient tape in tensorflow differentiate nested functions of the same network? 如何使用 TensorFlow 的梯度带对新数据集进行预测 - How to make predictions on new dataset with tensorflow's gradient tape 具有动量的梯度下降 - Gradient descent with momentum Tensorflow Keras 对于一个 model 的可训练变量受其他 Z49DDB8F35E630FCC3 的可训练变量影响，渐变磁带返回 None - Tensorflow Keras Gradient Tape returns None for a trainable variable of one model which is impacted by trainable variable of other model

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM