简体繁体中英

What does Random Forest do with unseen data?

原文 2016-08-16 16:57:42 0 1 python/ machine-learning/ scikit-learn/ random-forest

When I built my random forest model using scikit learn in python, I set a condition (where clause in sql query) so that the training data only contain values whose value is greater than 0.

I am curious to know how random forest handles test data whose value is less than 0, which the random forest model has never seen before in the training data.

1 answers

They will be treated in the same manner as the minimal value already encountered in the training set. RF is just a bunch of voting decision trees, and (basic) DTs can only form decisions in form of "if feature X is > then T go left, otherwise go right". Consequently, if you fit it to data which, for a given feature, has only values in [0, inf], it will either not use this feature at all or use it in a form given above (as decision of form "if X is > than T", where T has to be from (0, inf) to make any sense for the training data). Consequently if you simply take your new data and change negative values to "0", the result will be identical.

Random Forest does not classify

What does the “verbosity” parameter of a random forest mean? (sklearn)

How do I interpret my Random Forest Regression accuracy data?

Random Forest Classifier for Categorical Data?

Scikit Random Forest Classifier does not evaluate to True

Using Sklearn random forest for feature selection does not give me expected outcome when having categorical data

How to do cross-validation on random forest?

Recommendations for preventing data leakage in Random Forest Regressor

Dealing with big data to perform random forest classification

Random Forest on Panel Data using Python

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Random Forest does not classify What does the “verbosity” parameter of a random forest mean? (sklearn) How do I interpret my Random Forest Regression accuracy data? Random Forest Classifier for Categorical Data? Scikit Random Forest Classifier does not evaluate to True Using Sklearn random forest for feature selection does not give me expected outcome when having categorical data How to do cross-validation on random forest? Recommendations for preventing data leakage in Random Forest Regressor Dealing with big data to perform random forest classification Random Forest on Panel Data using Python

Related Tags

What does Random Forest do with unseen data?

Question

1 answers

solution1 0 ACCPTED 2016-08-16 17:06:31

solution1
0 ACCPTED 2016-08-16 17:06:31