简体   繁体   English

如何将文件的内容分配到几个节点?

[英]How to distribute a content of File into several nodes?

I need to distribute the content of a file (upto 5 GB) over a Network such that each distributed node(XML or Simple File) of a file won't have replicate data and get synced with each other. 我需要通过网络分发文件内容(最多5 GB),以使文件的每个分布式节点(XML或简单文件)都不会具有复制数据并彼此同步。

On a bigger note there could be more than 1000 of Big Files which needs to be take place in Distribution process. 总的来说,在分发过程中可能需要处理1000多个大文件。

Can i use Hadoop HDFS for Storing Big Files and use cluster Nodes for Distributed data? 我可以使用Hadoop HDFS来存储大文件,也可以将群集节点用于分布式数据吗?

Well yes, that's exactly what HDFS is used for. 是的,这正是HDFS的用途。 You can control the replication factor easily if you don't want any duplication, but a replication factor of 2+ is usually recommended to allow for some high availability. 如果您不想进行任何复制,则可以轻松控制复制因子,但是通常建议将复制因子设置为2+,以实现较高的可用性。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM