如何使用tensorflow逼近hessian矩阵的范数

Question

I wonder is there any method to recompute gradients with updated weights within a graph or if there is any better way to do this.我想知道是否有任何方法可以在图中重新计算具有更新权重的梯度，或者是否有更好的方法来做到这一点。 For example, for estimating hessian norm, we need to compute例如，为了估计 Hessian 范数，我们需要计算

delta ~ N(0, I)
hessian_norm = 1/M \sum_{1}^{M} gradient(f(x+delta))- gradient(f(x-delta))/(2*delta)

we need to gradient value on x+delta .我们需要在x+delta上梯度值。 Currently we will get None type if we use tf.gradient on var+delta directly.目前，如果我们直接在var+delta上使用tf.gradient ，我们将得到None类型。

More specifally speaking, if we define更具体地说，如果我们定义

a = tf.Variable
b = some_function(a)
grad = tf.gradients(b, a)

that's a normal gradient computation but if we do这是一个正常的梯度计算，但如果我们这样做

grad_delta = tf.gradients(b, a+delta)

it will return None .它将返回None 。 This feature seems to make it impossible to approximate the hessian norm using the above method.这个特征似乎使得使用上述方法无法近似于 hessian 范数。

Answer 1

b is not a function of a+delta , so you get None s. b不是 a a+delta的函数，所以你得到None s。 You either need to create new value b2 which depends on a+delta , or just move your a variable by delta and eval again to get second value.您可能需要创造新的价值b2依赖于a+delta ，或只动a可变delta和EVAL再次获得第二个值。

This is similar to how you do line search in TensorFlow .这类似于您在 TensorFlow 中进行线搜索的方式。

如何使用tensorflow逼近hessian矩阵的范数

问题描述

1 个解决方案

解决方案1
1 2017-10-10 17:09:54

如何使用tensorflow逼近hessian矩阵的范数

问题描述

1 个解决方案

解决方案1 1 2017-10-10 17:09:54

解决方案1
1 2017-10-10 17:09:54