简体繁体中英

scikit-learn: Issues on RFECV example

原文 2014-01-16 13:27:45 9 1 python/ scikit-learn/ feature-selection/ rfe

I'm having a difficulty in understanding the given RFECV example in current documentation. In the plot it's been written as "nb of misclassifications", so i expect it to be "lower the better". But in the example plot the best has been chosen as the highest cross-validation score. So i naturally expect it to be something related to accuracy (scoring says accuracy in the code anyways). But then how it becomes higher than 1?

I am a bit confused on how to interpret these results. I would appreciate any help on this.

Thanks!

1 answers

RFECV has a useful verbose option. Running with verbose=2 , you can see, that for a 2-fold cross-value check, as in example, grid_scores_ holds sum of both folds scores.

In general, for a n-fold check, grid_scores_ is sum of folds scores divided by n-1 , see in code . It seems to be a bug; see somewhat relevant issue on the tracker .

scikit-learn RFECV array with 0 samples

Score of RFECV() in python scikit-learn

How to do RFECV in scikit-learn with KFold, not StratifiedKFold?

scikit-learn denoising example in python

Example getting started with scikit-learn

weight issues in scikit-learn's adaboost

Tensorflow DNNClassifier and scikit-learn GridSearchCV issues

Memory issues using ARDRegression in scikit-learn

Scikit-Learn manually specifying .max_features in RFECV()-how many features get ranked?

Scikit-learn - feature reduction using RFECV and GridSearch. Where are the coefficients stored?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question scikit-learn RFECV array with 0 samples Score of RFECV() in python scikit-learn How to do RFECV in scikit-learn with KFold, not StratifiedKFold? scikit-learn denoising example in python Example getting started with scikit-learn weight issues in scikit-learn's adaboost Tensorflow DNNClassifier and scikit-learn GridSearchCV issues Memory issues using ARDRegression in scikit-learn Scikit-Learn manually specifying .max_features in RFECV()-how many features get ranked? Scikit-learn - feature reduction using RFECV and GridSearch. Where are the coefficients stored?

Related Tags

scikit-learn: Issues on RFECV example

Question

1 answers

solution1 1 2014-01-16 14:28:56

solution1
1 2014-01-16 14:28:56