简体繁体中英

clustering of tweets using k means algorithm as positive or negative

原文 2016-08-16 11:03:07 2 2 python/ twitter/ machine-learning/ cluster-analysis/ k-means

i have some movie reviews, i need to cluster them on the basis of positive or negative clusters. Using Kmeans is possible. Can anyone give me basic outline of how to start with it. In Python is preferable.

2 answers

you cannot cluster "as positive or negative"

You have labels. Use classification .

k-means will not be able to identify what is "positive". It may find any pattern, eg short vs. long, english vs. spanish tweets etc. - if you are lucky you can identify what it did.

You can start with sklearn package, one of well-known machine learning package. There you can use sklearn.cluster.KMeans.

Here is an exmaple from scikit-learn website .

Though you prefer python, R is also a good statistical tool that can do this. There is a function kmeans(x, centers) . It is builtin function, hence You donot need to import any package. What you need to do are read data and run it:

x = read.table(file,sep='\\t')

y = keman(x, centers=2)

K means Clustering using PySpark

K Means Clustering Algorithm Python Explanation needed

Predicting Values with k-Means Clustering Algorithm

K-Means Clustering Algorithm implementation

How can I use the k-means clustering algorithm using manhattan distance?

Am i clustering users correctly by using sklearn's cosine similarity method and K-means algorithm?

Python: how to compare the similarity between clustering using k-means algorithm?

When using the K-Means Clustering Algorithm, is it possible to have a set of data which results in an Infinite Loop?

K means clustering using weka python

Clustering using k-means in python

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question K means Clustering using PySpark K Means Clustering Algorithm Python Explanation needed Predicting Values with k-Means Clustering Algorithm K-Means Clustering Algorithm implementation How can I use the k-means clustering algorithm using manhattan distance? Am i clustering users correctly by using sklearn's cosine similarity method and K-means algorithm? Python: how to compare the similarity between clustering using k-means algorithm? When using the K-Means Clustering Algorithm, is it possible to have a set of data which results in an Infinite Loop? K means clustering using weka python Clustering using k-means in python

Related Tags

clustering of tweets using k means algorithm as positive or negative

Question

2 answers

solution1
3 2016-08-16 11:31:47

you cannot cluster "as positive or negative"

solution2
-1 2016-08-16 11:13:31

clustering of tweets using k means algorithm as positive or negative

Question

2 answers

solution1 3 2016-08-16 11:31:47

you cannot cluster "as positive or negative"

solution2 -1 2016-08-16 11:13:31

solution1
3 2016-08-16 11:31:47

solution2
-1 2016-08-16 11:13:31