简体繁体中英

Random Forest Feature Importance Robustness with Python

原文 2020-02-11 01:42:17 4 1 python/ random-forest

I am using Random Forest from Sklearn for feature importance. However, the importance of features may change by changing the random_state parameter in RF. I am wondering if there is any way to get robust feature importance with RF?

1 answers

it is because of the principal of Random Forest algorithm. RF finds the optimal by heuristic greedy way. And working on such heuristic way, it mitigates multiple trees with randomly sampled features and samples. And here random_state gives random numbers for sampling. If you see below documents, it says

If int, random_state is the seed used by the random number generator; If RandomState instance, random_state is the random number generator; If None, the random number generator is the RandomState instance used by np.random.

[ https://scikit-learn.org/stable/modules/generated/sklearn.tree.DecisionTreeClassifier.html][1]

So if you set random_state with fixed value, you may have fixed value for feature importance. It does not guarantee robustness because RF is not the algorithms guarantee robustness, but gives answer based on its heuristic finding.

Random Forest Feature Importance Python

Random Forest Feature Importance using Python

Random Forest Feature Importance Chart using Python

How to plot feature importance for random forest in python

Feature Importance for Random Forest Regressor in Python

Random Forest feature importance per value of a column in Python

Getting Feature importance in multioutput random forest regressor

Order of importance for each level of a feature in Random Forest

Random Forest Regressor Feature Importance all zero

Feature selection by Pearson correlation or Feature importance in Random Forest

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Random Forest Feature Importance Python Random Forest Feature Importance using Python Random Forest Feature Importance Chart using Python How to plot feature importance for random forest in python Feature Importance for Random Forest Regressor in Python Random Forest feature importance per value of a column in Python Getting Feature importance in multioutput random forest regressor Order of importance for each level of a feature in Random Forest Random Forest Regressor Feature Importance all zero Feature selection by Pearson correlation or Feature importance in Random Forest

Related Tags

Random Forest Feature Importance Robustness with Python

Question

1 answers

solution1 0 ACCPTED 2020-02-11 02:04:05

solution1
0 ACCPTED 2020-02-11 02:04:05