简体   繁体   中英

What is “Letter Distribution” and what is “Word Distribution” in NLP dataset while preforming Exploratory data analysis(EDA)

Guys im new to data analyst, Im trying to improve my skills so I toke a dataset from kaggle. these are task of the dataset I'm stuck on task 3 and 4 of EDA. anyone help me regarding this and how I can perform it. [Note: This is not any project. I just want to improve my skills for a job]

They want you to count the # (instances) of each word or letter in the dataset.

This is part of the EDA, however, so I believe you don't strictly need to do it, it is just potentially helpful for identifying further avenues for analysis.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM