简体   繁体   中英

How to Store Tweets in HDFS?

How to Store particular website tweets in HDFS ?

Suppose one website www.abcd.com and I want to collect all user's tweet for this website and stored into HDFS or Hive.

Flume and sqoop also helpful for storing data.

so anyone please suggest me how flume and sqoop work in storing tweets in HDFS?

Sqoop was not made for this purpose. Flume is used for these kind of needs. You can write your custom Flume source that will pull the tweets and dump them into your HDFS. See this for example. It shows how to use Flume to collect data from the Twitter Streaming API, and forward it to HDFS.

You can find more in the official documentation .

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM