Optimize sparse softmax cross entropy with L2 regularization

Question

I was training my network using tf.losses.sparse_softmax_cross_entropy as the classification function in the last layer and everything was working fine.

I simply added a L2 regularization over my weights now and my loss is not getting optimized anymore. What can be happening?

reg = tf.nn.l2_loss(w1) + tf.nn.l2_loss(w2)
loss = tf.reduce_mean(tf.losses.sparse_softmax_cross_entropy(y, logits)) + reg*beta
train_step = tf.train.GradientDescentOptimizer(learning_rate).minimize(loss)

Answer 1

It is hard to answer with certainty given the provided information, but here is a possible cause:

tf.nn.l2_loss is computed as a sum over the elements, while your cross-entropy loss is reduced to its mean (cf tf.reduce_mean ), hence a numerical unbalance between the 2 terms.

Try for instance to divide each L2 loss by the number of elements it is computed over (eg tf.size(w1) ).

Optimize sparse softmax cross entropy with L2 regularization

Question

1 answers

solution1
1 2018-06-20 19:47:59

Optimize sparse softmax cross entropy with L2 regularization

Question

1 answers

solution1 1 2018-06-20 19:47:59

solution1
1 2018-06-20 19:47:59