简体繁体中英

Machine learning parameter tuning using partitioned benchmark dataset

原文 2018-09-29 09:45:32 5 1 python/ machine-learning/ parameters/ scikit-learn/ svm

I know this will be very basic, however I'm really confused and I would like to understand parameter tuning better.

I'm working on a benchmark dataset that is already partitioned to three splits training, development, and testing and I would like to tune my classifier parameters using GridSearchCV from sklearn .

What is the correct partition to tune the parameter? is it the development or the training?

I've seen researchers in the literature mentioning that they " tuned the parameters using GridSearchCV on the development split " another example is found here ;

Do they mean they trained on the training split then tested on the development split? or do ML practitioners usually mean they perform the GridSearchCV entirely on the development split?

I'd really appreciate a clarification. Thanks,

1 answers

Usually in a 3-way split you train a model using a training set, then you validate it on a development (which is also called validation set) set to tune hyperpameters and then after all the tuning is complete you perform a final evaluation of a model on an unseen before testing set (also known as evaluation set).

In a two-way split you just have a train set and a test set, so you perform tuning/evaluation on the same test set.

Hyper-parameter Tuning for a machine learning model

Fine tuning a deep learning model using my own dataset

Machine Learning with an Unbalanced Dataset

Replace the useless characters in dataset to clean it using python in Machine Learning

Machine Learning -Issues with big dataset

google cloud machine learning hyperparameter tuning avoid Nans

Different results on the same dataset in machine learning

Machine Learning dataset with many discrete features

Upload dataframe as dataset in Azure Machine Learning

Does standarizing the dataset in machine learning decrease accuracy?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Hyper-parameter Tuning for a machine learning model Fine tuning a deep learning model using my own dataset Machine Learning with an Unbalanced Dataset Replace the useless characters in dataset to clean it using python in Machine Learning Machine Learning -Issues with big dataset google cloud machine learning hyperparameter tuning avoid Nans Different results on the same dataset in machine learning Machine Learning dataset with many discrete features Upload dataframe as dataset in Azure Machine Learning Does standarizing the dataset in machine learning decrease accuracy?

Related Tags

Machine learning parameter tuning using partitioned benchmark dataset

Question

1 answers

solution1 0 2018-09-29 10:42:42

solution1
0 2018-09-29 10:42:42