简体   繁体   English

从测试数据集中获得最佳准确性

[英]To get best accuracy from Testing dataset

I am using Random forest classifier to classify data into 4 labels.我正在使用随机森林分类器将数据分类为 4 个标签。 There are a total of 20 features on which the model is being trained.训练模型的总共有 20 个特征。 I am observing an accuracy of around 45-47 % when Testing dataset is used.当使用测试数据集时,我观察到大约 45-47% 的准确度。 Although on prediction of Training dataset I'm getting an accuracy of 100%.尽管在训练数据集的预测中我得到了 100% 的准确率。 Also I'm using the best parameters extracted using Grid Search approach.我还使用了使用网格搜索方法提取的最佳参数。 Can anyone explain why such kind of biasing between training and testing prediction.谁能解释为什么训练和测试预测之间存在这种偏差。 How can I enhance the scenario?我怎样才能增强场景?

PS: I'm new to Machine Learning PS:我是机器学习的新手

There might be many Reasons 1)one might be, model is over fitting.可能有很多原因 1) 一个可能是,模型过拟合。 you can try doing hyper parameter optimization to find the optimal value where your model performs better.您可以尝试进行超参数优化以找到模型性能更好的最佳值。 2) Since you are using accuracy as a performance parameter, You can check whether data set is balanced or not. 2)由于您使用准确性作为性能参数,您可以检查数据集是否平衡。 If you are using imbalanced data set,you can use ROC,AUC Characteristics.如果你使用的是不平衡数据集,你可以使用 ROC,AUC 特征。

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何测试不在数据集中的图像的准确性 - How to test accuracy from images that were not in the dataset 无法在自己的数字图像数据集中为 mnist 获得准确性 - Can't get accuracy in own digit image dataset for mnist Python 多次复制数据集并重新洗牌以获得更高准确度的缺点 - Python Downside of duplicating dataset several times and reshuffle it to get higher accuracy 加载保存的模型后,在同一评估数据集上获得不同的精度 - Get different accuracy on the same evaluating dataset after loading saved model 葡萄酒质量数据集的培训准确性无法达到55%以上 - Cannot get above 55% TRAINING accuracy on wine quality dataset Tensorflow 训练准确率和损失与评估相同数据集不同 - Tensorflow training accuracy and loss different from evaluation of the same dataset 在 CIFAR 10 数据集上使用 VGG16 进行迁移学习:训练和测试精度非常高但预测错误 - Transfer Learning Using VGG16 on CIFAR 10 Dataset: Very High Training and Testing Accuracy But Wrong Predictions 如何使用 Detectron2 的 tensorboard 获得测试精度? - How can I get testing accuracy using tensorboard for Detectron2? 在训练集上进行测试时,Keras Stateful LSTM的准确性较低 - Keras Stateful LSTM get low accuracy when testing on training set 为什么我的 XBGoost model 对训练和测试数据集有很好的准确性,但在预测保留数据集时却很差? - Why does my XBGoost model have a good accuracy for training and testing dataset, but poor one for predicting an held out dataset?
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM