简体繁体中英

Machine learning algorithm to classify only positive and unlabeled data

原文 2014-04-04 20:59:36 6 1 algorithm/ machine-learning/ weka

I am trying to classify text with only positive features and unlabeled data. I just want the algorithm to identify the positive data and want to mark everything else as negative. What would be a good machine learning algorithm to classify such data? I tried using different algorithms in Weka but almost all classifiers give a lot of false positives.

1 answers

If you believe that the unlabelled data is mostly negatives, then probably the best thing to do is to label all unlabelled data as "negative" and run your classifier of choice. Note that if you get an unlabelled testing data point predicted to be positive, this does not mean the answer is wrong. Some of your unlabelled data could be positive. So it's hard to judge how well your classifier is doing in your setting. If you believe that your unlabelled data might be biased toward the positives then you're probably better off using so-called "one-class classifiers" on the positive data, there are popular examples including one-class SVM.

Machine Learning Algorithm for Completing Sparse Matrix Data

Machine learning algorithm

Does a machine learning algorithm copy the data it learns from?

How can we use a machine learning algorithm on this type of data?

Machine Learning/Artificial Intelligence - Classify column based on the value / pattern

Suggestions on what machine language algorithm to classify what time a user logs in

Does the dataset size influence a machine learning algorithm?

Machine learning classifying algorithm with “unknown” class

Machine learning algorithm for correlation between indicators

Machine Learning Algorithm for Peer-to-Peer Nodes

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Machine Learning Algorithm for Completing Sparse Matrix Data Machine learning algorithm Does a machine learning algorithm copy the data it learns from? How can we use a machine learning algorithm on this type of data? Machine Learning/Artificial Intelligence - Classify column based on the value / pattern Suggestions on what machine language algorithm to classify what time a user logs in Does the dataset size influence a machine learning algorithm? Machine learning classifying algorithm with “unknown” class Machine learning algorithm for correlation between indicators Machine Learning Algorithm for Peer-to-Peer Nodes

Related Tags

Machine learning algorithm to classify only positive and unlabeled data

Question

1 answers

solution1 3 2014-04-04 21:10:26

solution1
3 2014-04-04 21:10:26