简体繁体中英

random forest regression output calculation

原文 2018-11-17 02:55:20 8 1 r/ random-forest

Hi this is a purely theoretical question which i cant get my head around ( and could be completely wrong)

With random forest regressions - you grow n number of trees, each tree uses a subset of the data and in some cases a subset of the available variables to predict the dependent variable. the average of these n number of trees is taken to give us a predicted value. however, is there any need to look at the distribution of predictions at the individual tree level? are we able to obtain a number that provides some certainty of the overall predicted value? i would assume that a more consistent number being produced at the individual tree level would be preferred than a wide variety of numbers?

Thanks in advance

1 answers

This method of determining variable importance has some drawbacks. For data including categorical variables with different number of levels, random forests are biased in favor of those attributes with more levels. Methods such as partial permutations and growing unbiased trees can be used to solve the problem. If the data contain groups of correlated features of similar relevance for the output, then smaller groups are favored over larger groups.

regression with random forest on imbalanced data

Random forest regression - cumulative MSE?

Random Forest vs Logistic Regression

RMSE calculation for random forest in R?

Random forest output interpretation

How to get random forest regression performance output in Python like that produced in R?

Finding how variable affect output of time-series random-forest regression model

Random Forest, SVM and Multinomial Logistic Regression with R

Error using Caret Package for Random Forest (Regression)

Compute AUC of a random uniform forest in regression

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question regression with random forest on imbalanced data Random forest regression - cumulative MSE? Random Forest vs Logistic Regression RMSE calculation for random forest in R? Random forest output interpretation How to get random forest regression performance output in Python like that produced in R? Finding how variable affect output of time-series random-forest regression model Random Forest, SVM and Multinomial Logistic Regression with R Error using Caret Package for Random Forest (Regression) Compute AUC of a random uniform forest in regression

Related Tags

random forest regression output calculation

Question

1 answers

solution1 0 2018-11-17 14:23:43

solution1
0 2018-11-17 14:23:43