简体繁体中英

How to use feature selection and dimensionality reduction in Unsupervised learning?

原文 2016-05-27 07:01:58 5 1 python-3.x/ machine-learning/ scikit-learn/ unsupervised-learning

I've been working on classifying emails from two authors. I've been successful in executing the same using supervised learning along with TFIDF vectorization of text, PCA and SelectPercentile feature selection. I used scikit-learn package to achieve the same.

Now I wanted to try the same using Unsupervised Learning KMeans algorithm to cluster the emails into two groups. I have created dataset wherein I have each data point as a single line in the python list. Since I am a newbie to unsupervised so I wanted to ask if I can apply the same dimensionality reduction tools as used in supervised (TFIDF, PCA and SelectPercentile). If not then what are their counterparts? I am using scikit-learn for coding it up.

I looked around on stackoverflow but couldn't get a satisfactory answer. I am really stuck at this point.

Please help!

1 answers

Following are the techniques for dimensionality reduction that can be applied in case of Unsupervised Learning:-

PCA: principal component analysis
- Exact PCA
- Incremental PCA
- Approximate PCA
- Kernel PCA
- SparsePCA and MiniBatchSparsePCA
Random projections
- Gaussian random projection
- Sparse random projection
Feature agglomeration
- Standard Scaler

Mentioned above are some of the approaches that can be used for dimensionality reduction of huge data in case on unsupervised learning. You can read more about the details here .

Sklearn: How to apply dimensionality reduction on huge data set?

How to print a string to replace the value of the labels in unsupervised learning

Feature Selection(How to interpret the results)?

Using TSNE to dimensionality reduction. Why 3 D graph is not working?

Feature importance/ selection per class. How?

How to process feature vectors with different dimension in machine learning?

Suggestions on Feature selection techniques?

ANOVA Feature Selection in python

Feature selection on a keras model

Python feature selection

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Sklearn: How to apply dimensionality reduction on huge data set? How to print a string to replace the value of the labels in unsupervised learning Feature Selection(How to interpret the results)? Using TSNE to dimensionality reduction. Why 3 D graph is not working? Feature importance/ selection per class. How? How to process feature vectors with different dimension in machine learning? Suggestions on Feature selection techniques? ANOVA Feature Selection in python Feature selection on a keras model Python feature selection

Related Tags

How to use feature selection and dimensionality reduction in Unsupervised learning?

Question

1 answers

solution1 1 ACCPTED 2016-06-01 05:29:05

solution1
1 ACCPTED 2016-06-01 05:29:05