简体   繁体   中英

Weka Attribute Selection

I'm trying to perform Attribute Selection in Weka. I would like to use InfoGainAttributeEval as an evaluator, because I read that it is equivalent to mutual information, and Ranker as a search method. Should I perform attribute selection to both training and test set? Also, how can I choose the correct value for the N parameter?

Thanks a lot for your time,

Nadia

Applying attribute selection separately on the train and test might result in a selection of different attributes, thereby making them incompatible. Thus to make sure that both sets have the same attributes you need to apply attribute selection on your whole dataset. Once you have selected the most useful attributes you split your data into a train and test set.

As to which value of -N to use, I would use your total amount of attributes. This will result in a ranked list of all your attributes and you can evaluate the different scores of all attributes yourself. You might then spot a clear threshold separating the attributes holding any useful information to train a classifier from attributes which add nothing. I would then set this threshold using the -T option.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM