特征选择（如何解释结果）？

Question

Let's say i have a dataset and i want to select features which correspond to prediction result better than others. 假设我有一个数据集，并且我想选择比预测结果更好的特征。 I have implemented some Feature Ranking tests and here are the results: 我已经实施了一些功能排名测试，结果如下：

For prediction model I selected features with best "Mean" value. 对于预测模型，我选择了具有最佳“均值”值的特征。

X = oil_10[['Sidetrack Code','Well Type Code','Well Status  
Code','Producing Formation','Water Produced, bbl','County']]

Here is the prediction model result with "Best chosen features": 这是带有“最佳选择的特征”的预测模型结果：

RandomForestRegressor
0.390502562474

And here is the result of prediction model with all dataset features without any selection: 这是具有所有数据集功能而没有任何选择的预测模型的结果：

RandomForestRegressor
0.741878611892

How to use Feature Ranking results to implement best prediction result? 如何使用功能排行结果实现最佳预测结果？

Answer 1

好吧，我试图用这种方式解决我的问题：我只删除了最不重要的功能（这些功能的平均重要性值小于0.15），而准确度仍保持75％，但是预测模型现在的运行速度更快。

特征选择（如何解释结果）？

问题描述

1 个解决方案

解决方案1
0 2017-11-02 14:49:01

特征选择（如何解释结果）？

问题描述

1 个解决方案

解决方案1 0 2017-11-02 14:49:01

解决方案1
0 2017-11-02 14:49:01