Sklearn：有没有办法为管道定义特定的分数类型？

Question

I can do this:我可以做这个：

model=linear_model.LogisticRegression(solver='lbfgs',max_iter=10000)
kfold = model_selection.KFold(n_splits=number_splits,shuffle=True, random_state=random_state)
scalar = StandardScaler()
pipeline = Pipeline([('transformer', scalar), ('estimator', model)])
results = model_selection.cross_validate(pipeline, X, y, cv=kfold, scoring=score_list,return_train_score=True)

where score_list can be something like ['accuracy','balanced_accuracy','precision','recall','f1'] .其中 score_list 可以类似于['accuracy','balanced_accuracy','precision','recall','f1'] 。

I also can do this:我也可以这样做：

kfold = model_selection.KFold(n_splits=number_splits,shuffle=True, random_state=random_state)
scalar = StandardScaler()
pipeline = Pipeline([('transformer', scalar), ('estimator', model)])
for i, (train, test) in enumerate(kfold.split(X, y)):
    pipeline.fit(self.X[train], self.y[train])
    pipeline.score(self.X[test], self.y[test])

However, I am not able to change the score type for pipeline in the last line.但是，我无法在最后一行更改管道的分数类型。 How can I do that?我怎样才能做到这一点？

Answer 1

score method is always accuracy for classification and r2 score for regression. score方法始终是分类的accuracy和回归的r2分数。 There is no parameter to change that.没有参数可以改变它。 It comes from the Classifiermixin and RegressorMixin .它来自Classifiermixin和RegressorMixin 。

Instead, when we need other scoring options, we have to import it from sklearn.metrics like the following.相反，当我们需要其他评分选项时，我们必须从sklearn.metrics中导入它，如下所示。

from sklearn.metrics import balanced_accuracy

y_pred=pipeline.score(self.X[test])
balanced_accuracy(self.y_test, y_pred)

Sklearn：有没有办法为管道定义特定的分数类型？

问题描述

1 个解决方案

解决方案1
3 已采纳 2020-04-29 08:07:43

Sklearn：有没有办法为管道定义特定的分数类型？

问题描述

1 个解决方案

解决方案1 3 已采纳 2020-04-29 08:07:43

解决方案1
3 已采纳 2020-04-29 08:07:43