为什么 sklearn.metrics.confusion_matrix 和 sklearn.metrics.plot_confusion_matrix 的 function 定义不一致？

Question

I am using sklearn and I noticed that the arguments of sklearn.metrics.plot_confusion_matrix and sklearn.metrics.confusion_matrix are inconsistent.我正在使用 sklearn，我注意到sklearn.metrics.plot_confusion_matrix和sklearn.metrics.confusion_matrix的 arguments 不一致。 plot_confusion_matrix uses estimator and X to construct y_pred , while confusion_matrix has y_pred as argument directly. plot_confusion_matrix使用estimator和X来构造y_pred ，而confusion_matrix直接将y_pred作为参数。

What may be the reason for this inconsistency?这种不一致的原因可能是什么？

Partial function definitions:部分 function 定义：

sklearn.metrics.plot_confusion_matrix(estimator, X, y_true, ...) [where X should be X_test] sklearn.metrics.plot_confusion_matrix(estimator, X, y_true, ...) [其中 X 应该是 X_test]
sklearn.metrics.confusion_matrix(y_true, y_pred, ...)

Sources:资料来源：

plot_confusion_matrix plot_confusion_matrix
confusion_matrix 混淆矩阵

Answer 1

Yes, you are right that there isn't a consistent API design for this but there is an on going discussion for this issue here .是的，你是对的，没有一致的 API 设计，但这里有一个关于这个问题的持续讨论。

One quick work around is ConfusionMatrixDisplay .一种快速的解决方法是ConfusionMatrixDisplay 。

example:例子：

from sklearn.datasets import make_classification
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import make_pipeline
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split

X, y = make_classification(random_state=1)
X_train, X_test, y_train, y_test = train_test_split(X, y, stratify=y)

clf = make_pipeline(StandardScaler(), LogisticRegression(random_state=0))
clf.fit(X_train, y_train)

from sklearn.metrics import confusion_matrix
from sklearn.metrics import ConfusionMatrixDisplay

y_pred = clf.predict(X_test)
cm = confusion_matrix(y_test, y_pred)

cm_display = ConfusionMatrixDisplay(cm, [0,1]).plot()

为什么 sklearn.metrics.confusion_matrix 和 sklearn.metrics.plot_confusion_matrix 的 function 定义不一致？

问题描述

1 个解决方案

解决方案1
2 已采纳 2020-04-18 07:34:42

为什么 sklearn.metrics.confusion_matrix 和 sklearn.metrics.plot_confusion_matrix 的 function 定义不一致？

问题描述

1 个解决方案

解决方案1 2 已采纳 2020-04-18 07:34:42

解决方案1
2 已采纳 2020-04-18 07:34:42