python - 如何提高随机森林多类分类模型的准确率？

n_estimators

这是在进行最大投票或预测平均值之前要构建的树的数量。 树的数量越多，性能越好，但会使代码变慢。 您应该选择处理器可以处理的尽可能高的值，因为这会使您的预测更强大、更稳定。 由于您的数据量较大，因此每次迭代需要更多时间，但请尝试这样做。

最大特征

这些是随机森林允许在单个树中尝试的最大特征数。 Python 中有多个选项可用于分配最大功能。 其中很少有：

Auto/None ：这将简单地采用所有有意义的功能
在每棵树中。这里我们只是不对单个树设置任何限制。

sqrt ：此选项将对单个运行中的特征总数取平方根。 例如，如果变量总数为 100，我们只能在单个树中取 10 个。“log2”是 max_features 的另一种类似选项。

0.2 ：此选项允许随机森林在单个运行中采用 20% 的变量。 我们可以以“0.x”格式分配和赋值，其中我们希望考虑 x% 的特征。

[英]Random Forest Improve Accuracy

[英]How to extract random forest tree rules for a Multiclass Classification?

[英]Single-label multiclass classification random forest python

[英]How can we calculate accuracy for the Random forest classifier if we are using 4 label classification?

[英]Why is this accuracy of this Random forest sentiment classification so low?

[英]Random Forest Multi Class Python does not improve accuracy

[英]How to calculate the average value of Accuracy, FPR, FNR in a multiclass classification in Python?

[英]How to get the adjacent accuracy scores for a multiclass classification problem in Python?

[英]How to reduce loss and improve accuracy in text classification?

[英]How to measure Random Forest classifier accuracy?

如何提高随机森林多类分类模型的准确率？