简体   繁体   中英

Hadoop - streaming data from HTTP upload (PUT) into HDFS directly

I have the following application deployment

  • web front-end taking data from client through HTTP/FTP
  • hadoop cluster

I need to store client's data on HDFS. What is the best way of doing that? Is it possible to stream data to HDFS directly, without consuming all data from the client on local drive, and then put it into the HDFS?

The feasible options which I can think of right now are :

Choosing the "best" one is totally upto you, based on your convenience and ease.

Personally, if you want low latency access to HDFS, your best bet is HBase. You can put and get values very easily since it is just a key value store. We are using the same thing in our application(s) and it works fabulously.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM