SparseCategoricalCrossentropy 形状不匹配

Question

I want to do a simple test of the SparseCategoricalCrossentropy function, to see what exactly it does to an output.我想对 SparseCategoricalCrossentropy 函数做一个简单的测试，看看它到底对输出做了什么。 For that I use the output of the last layer of a MobileNetV2.为此，我使用 MobileNetV2 的最后一层的输出。

    import keras.backend as K

    full_model = tf.keras.applications.MobileNetV2(
    input_shape=(224,224,3),
    alpha=1.0,
    include_top=True,
    weights="imagenet",
    input_tensor=None,
    pooling=None,
    classes=1000,
    classifier_activation="softmax",)

    func = K.function(full_model.layers[1].input, full_model.layers[155].output)
    conv_output = func([processed_image])
    y_pred = np.single(conv_output)
    
    y_true = np.zeros(1000).reshape(1,1000)
    y_true[0][282] = 1
    
    scce = tf.keras.losses.SparseCategoricalCrossentropy()
    scce(y_true, y_pred).numpy()

processed_image is a 1x224x224x3 array created previously. processed_image是先前创建的1x224x224x3阵列。

I'm getting the error ValueError: Shape mismatch: The shape of labels (received (1000,)) should equal the shape of logits except for the last dimension (received (1, 1000)).我收到错误ValueError: Shape mismatch: The shape of labels (received (1000,)) should equal the shape of logits except for the last dimension (received (1, 1000)).

I tried reshaping the arrays to match the dimensions the error mentioned, but it doesn't seem to work.我尝试重塑数组以匹配提到的错误的维度，但它似乎不起作用。 What shapes does it accept?它接受什么形状？

Answer 1

Since you are using the SparseCategoricalCrossentropy loss function, the shape of y_true should be [batch_size] and the shape of y_pred should be [batch_size, num_classes] .由于您使用的是SparseCategoricalCrossentropy损失函数，因此y_true的形状应为[batch_size] ，而y_pred的形状应为[batch_size, num_classes] 。 Furthermore, y_true should consist of integer values.此外， y_true应该由整数值组成。 See the documentation .请参阅文档。 In your concrete example, you could try something like this:在你的具体例子中，你可以尝试这样的事情：

import keras.backend as K
import tensorflow as tf
import numpy as np

full_model = tf.keras.applications.MobileNetV2(
             input_shape=(224,224,3),
             alpha=1.0,
             include_top=True,
             weights="imagenet",
             input_tensor=None,
             pooling=None,
             classes=1000,
             classifier_activation="softmax",)

batch_size = 1
processed_image = tf.random.uniform(shape=[batch_size,224,224,3])
func = K.function(full_model.layers[1].input, 
full_model.layers[155].output)
conv_output = func([processed_image])
y_pred = np.single(conv_output)

# Generates an integer between 0 and 999 representing a class index.
y_true = np.random.randint(low = 0, high = 999, size = batch_size)
# [984]
scce = tf.keras.losses.SparseCategoricalCrossentropy() 
scce(y_true, y_pred).numpy()
# y_pred encodes a probability distribution here and the calculated loss is 10.69202

You can experiment with the batch_size to see how everything works.您可以尝试使用batch_size来查看一切是如何工作的。 In the example above, I just used a batch_size of 1.在上面的例子中，我只使用了 1 的batch_size 。

SparseCategoricalCrossentropy 形状不匹配

问题描述

1 个解决方案

解决方案1
1 已采纳 2021-10-15 09:35:43

SparseCategoricalCrossentropy 形状不匹配

问题描述

1 个解决方案

解决方案1 1 已采纳 2021-10-15 09:35:43

解决方案1
1 已采纳 2021-10-15 09:35:43