Tf.keras model.predict() 返回大于 1 的 class 概率？

Question

我试图在 CNN 的 tf.keras 中调用 model.predict() 来预测单个图像的 class。 出于某种原因，class 的概率返回高于 1，这是荒谬的。 我不确定为什么会发生这种情况。 以下是我训练 CNN 的方法：

class_names = ['Angry','Disgust','Fear','Happy','Sad','Surprise','Neutral']
model = models.Sequential()
model.add(layers.Conv2D(64, (3, 3), activation='relu', input_shape=(48, 48, 1), kernel_regularizer=tf.keras.regularizers.l1(0.01)))
model.add(layers.Conv2D(128, (3, 3), padding='same', activation='relu'))
model.add(layers.MaxPooling2D((2, 2)))
model.add(tf.keras.layers.Dropout(0.5))
model.add(layers.Conv2D(64, (3, 3), activation='relu'))
model.add(layers.MaxPooling2D((2, 2)))
model.add(tf.keras.layers.Dropout(0.5))
model.add(layers.Conv2D(64, (3, 3), activation='relu'))


model.summary()

model.add(layers.Flatten())
model.add(layers.Dense(64, activation='relu'))
model.add(layers.Dense(7))


#model.summary()
model.compile(optimizer='adam',loss=tf.keras.losses.CategoricalCrossentropy(from_logits=True),metrics=['accuracy'])
lr_reducer = tf.keras.callbacks.ReduceLROnPlateau(monitor='val_loss', factor=0.9, patience=3) #monitors the validation loss for signs of a plateau and then alter the learning rate by the specified factor if a plateau is detected

early_stopper = tf.keras.callbacks.EarlyStopping(monitor='val_accuracy', min_delta=0, patience=6, mode='auto')  #This will monitor and stop the model training if it is not further converging

checkpointer = tf.keras.callbacks.ModelCheckpoint('C:\\Users\\rtlum\\Documents\\DataSci_Projects\\PythonTensorFlowProjects\\Datasets\\FER2013_Model_Weights\\Model\\weights.hd5', monitor='val_loss', verbose=1, save_best_only=True) #This allows checkpoints to be saved each epoch just in case the model stops training

epochs = 100
batch_size = 64
learning_rate = 0.001

model.fit(
          train_data,
          train_labels,
          epochs = epochs,
          batch_size = batch_size,
          validation_split = 0.2,
          shuffle = True,
          callbacks=[lr_reducer, checkpointer, early_stopper]
          )

下面是我如何调用 model.predict() 并传入单个图像进行预测：

    model = tf.keras.models.load_model('Model\\weights.hd5')
    img = Image.open(test_image).convert('L')
    img = img.resize([48, 48])
    image_data = np.asarray(img, dtype=np.uint8)
    #image_data = np.resize(img,3072)
    image_data = image_data / 255
    image_data_test = image_data.reshape((1, 48, 48, 1))
    class_names = ['Angry','Disgust','Fear','Happy','Sad','Surprise','Neutral']
    x = model.predict(image_data_test)
    app.logger.info(x)
    image_pred = np.argmax(x)
    y = round(x[0][np.argmax(x)], 2)
    confidence = y * 100
    print(class_names[image_pred], confidence)

最后，下面是我从 model.predict() 收到的 class 概率：

>>> x = model.predict(image_data_test)
>>> x
array([[ 1.0593076 , -3.5140653 ,  0.7505076 ,  2.1341033 ,  0.02394461,
        -0.08749148,  0.6640976 ]], dtype=float32)

Answer 1

您的最后一层model.add(layers.Dense(7))正在使用线性激活 function。 要获得 7 个类别的概率，您应该使用softmax激活。

将最后一层更改为

model.add(layers.Dense(7 , activation='softmax'))

Answer 2

添加激活层以将您的 output 值转换为 [0,1] 的值

Tf.keras model.predict() 返回大于 1 的 class 概率？

问题描述

2 个解决方案

解决方案1
5 已采纳 2020-05-28 02:29:22

解决方案2
-1 2020-05-28 02:27:26

Tf.keras model.predict() 返回大于 1 的 class 概率？

问题描述

2 个解决方案

解决方案1 5 已采纳 2020-05-28 02:29:22

解决方案2 -1 2020-05-28 02:27:26

解决方案1
5 已采纳 2020-05-28 02:29:22

解决方案2
-1 2020-05-28 02:27:26