[英]Feature Selection(How to interpret the results)?
Let's say i have a dataset and i want to select features which correspond to prediction result better than others. 假设我有一个数据集,并且我想选择比预测结果更好的特征。 I have implemented some Feature Ranking tests and here are the results:
我已经实施了一些功能排名测试,结果如下:
For prediction model I selected features with best "Mean" value. 对于预测模型,我选择了具有最佳“均值”值的特征。
X = oil_10[['Sidetrack Code','Well Type Code','Well Status
Code','Producing Formation','Water Produced, bbl','County']]
Here is the prediction model result with "Best chosen features": 这是带有“最佳选择的特征”的预测模型结果:
RandomForestRegressor
0.390502562474
And here is the result of prediction model with all dataset features without any selection: 这是具有所有数据集功能而没有任何选择的预测模型的结果:
RandomForestRegressor
0.741878611892
How to use Feature Ranking results to implement best prediction result? 如何使用功能排行结果实现最佳预测结果?
好吧,我试图用这种方式解决我的问题:我只删除了最不重要的功能(这些功能的平均重要性值小于0.15),而准确度仍保持75%,但是预测模型现在的运行速度更快。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.