简体   繁体   English

如何使用pycharm应用TFIDF在CSV文件中查找重要单词

[英]How to apply TFIDF to find important words in csv file using pycharm

I have a file that includes some data, 我有一个包含一些数据的文件,

An example of the data I have 我有一个数据示例

+------------+---------------------------------+-------------------------+
|  SOC Code  |              Title              |  Occupational Category  |
+------------+---------------------------------+-------------------------+
| 11-1011.03 | Chief Sustainability Officers   | New & Emerging          |
| 11-1021.00 | General and Operations Managers | Enhanced Skills         |
+------------+---------------------------------+-------------------------+

I need to find the most frequent words in the file Any ideas on how can this be applied? 我需要在文件中找到最常用的单词。有关如何应用的任何想法? pieces of codes would be appreciated as an example 代码片段将作为示例

您可以使用NLTK FreqDist方法对单词计数,然后返回最常用的单词。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM