Keras混淆矩陣：ValueError：分類指標無法處理多類多輸出和二進制目標的混合

Question

我敢肯定，我做錯了一些簡單的事情。 我最近在代碼中添加了一個混淆矩陣，它給出了錯誤“ ValueError：分類指標無法處理多類多輸出和二進制目標的混合情況”。 y訓練值被編碼為“基本事實”。 我以為我的目標應該是二進制輸出，還是應該一鍵編碼？

我可能使用了錯誤的損失屬性嗎？ 有什么想法嗎？


def get_model(word_length):
    dim1 = 28
    dim2 = 28
    input_signal = Input(shape=(dim1, dim2, 2))
    x = Conv2D(64, (3, 3), activation='relu', padding='same')(input_signal)
    x = MaxPooling2D((2, 2), padding='same')(x)
    x = Conv2D(32, (3, 3), activation='relu', padding='same')(x)
    x = MaxPooling2D((2, 2), padding='same')(x)
    x = Conv2D(32, (3, 3), activation='relu', padding='same')(x)
    x = MaxPooling2D((2, 2), padding='same')(x)
    x = Flatten()(x)
    x = Dense(128, activation='relu')(x)
    x = Dense(word_length, activation='softmax')(x)

    model = Model(inputs=input_signal, outputs=x)
    model.summary()
    return model


if __name__ == "__main__":

    mlflow.keras.autolog()

trainPath = "Data/"
totalsamples = 0

train = get_data(trainPath)
X = train.path
labelbinarizer = LabelBinarizer()
y = labelbinarizer.fit_transform(train.word)

X, Xt, y, yt = train_test_split(X, y, test_size=0.3, stratify=y)
batchsize = 10

train_gen = batch_generator(X, y, batchsize)
valid_gen = batch_generator(Xt, yt, batchsize)

tensorboard = TensorBoard(log_dir='./logs/{}'.format(time.time()),
                          batch_size=32)

word_length = len(train.word.unique())
model = get_model(word_length)
model.compile(loss='categorical_crossentropy',
              optimizer=Adam(), metrics=['accuracy'])


step_train_gen = X.shape[0] // batchsize
step_valid_gen = Xt.shape[0] // batchsize

steps = step_train_gen
valid_steps = step_valid_gen

history = model.fit_generator(
            generator=train_gen,
            epochs=7,
            steps_per_epoch=steps,
            validation_data=valid_gen,
            validation_steps=valid_steps,
            callbacks=[tensorboard])

history = history.history
print('Validation accuracy: {acc}, loss: {loss}'.format(
        acc=history['val_acc'][-1], loss=history['val_loss'][-1]))


predictions = model.predict_generator(valid_gen, verbose=0, steps=valid_steps)

y_pred = np.argmax(predictions, axis=1)

print(confusion_matrix(labels, y_pred))

Answer 1

由於我沒有看到完整的代碼，因此我假設以下內容：

confusion_matrix從sklearn.metrics導入
labels = yt

現在， confusion_matrix不喜歡一鍵編碼的輸入。 由於模型輸出是一鍵編碼的，因此以下方法可以解決您的問題：

y_pred_labels = y_pred.argmax(1)
confusion_matrix(yt, y_pred_labels)

請注意以下運行，指示實現支持的格式：

# 1. Test run with labels
In [1]: from sklearn.metrics import confusion_matrix                                                                                                                                                               
In [2]: y_true = [2, 0, 2, 2, 0, 1]                                                                                                                                                                                
In [3]: y_pred = [0, 0, 2, 2, 0, 2]                                                                                                                                                                                
In [4]: confusion_matrix(y_true, y_pred)                                                                                                                                                                           
Out[4]: 
array([[2, 0, 0],
       [0, 0, 1],
       [1, 0, 2]])

# 2. Test run with one-hot encoded vectors
In [5]: y_true = [[1,0],[0,0], [1,0], [1,0], [0, 0], [0, 1]]
In [6]: y_pred = [[0, 0], [0, 0], [1, 0], [1, 0], [0, 0], [1, 0]]
In [7]: confusion_matrix(y_true, y_pred)                                                                                                                                                                           
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-7-1a5d7236d395> in <module>
----> 1 confusion_matrix(y_true, y_pred)
ValueError: multiclass-multioutput is not supported

# 3. Test run with categorical input
In [8]: y_pred = ["zero", "zero", "two", "two", "zero", "two"]    
In [9]: y_true = ["two", "zero", "two", "two", "zero", "one"]
In [10]: confusion_matrix(y_true, y_pred, labels=["zero", "one", "two"])                                                                                                                                           
Out[10]: 
array([[2, 0, 0],
       [0, 0, 1],
       [1, 0, 2]])

Keras混淆矩陣：ValueError：分類指標無法處理多類多輸出和二進制目標的混合

問題描述

1 個解決方案

解決方案1
0 2019-11-25 23:48:09

Keras混淆矩陣：ValueError：分類指標無法處理多類多輸出和二進制目標的混合

問題描述

1 個解決方案

解決方案1 0 2019-11-25 23:48:09

解決方案1
0 2019-11-25 23:48:09