简体   繁体   English

设置(Linux)Hadoop集群

[英]Setting up a (Linux) Hadoop cluster

您是否需要先设置Linux集群才能设置Hadoop集群?

No. Hadoop has its own software to manage a "cluster". 不,Hadoop有自己的软件来管理“集群”。 Just install linux and make sure the machines can talk to each other. 只需安装linux并确保机器可以相互通信。

Deploying the Hadoop software, along with the appropriate config files, and starting it on each node (which Hadoop can do automatically) creates the cluster from the Linux machines you have. 部署Hadoop软件以及相应的配置文件,并在每个节点(Hadoop可以自动执行)上启动它,从您拥有的Linux机器创建集群。 So, no, by that definition you don't need to have a separate linux cluster. 所以,不,根据这个定义,你不需要一个单独的Linux集群。 If your question is whether or not you need to have a multiple-machine cluster to use Hadoop: no, you can run Hadoop on a single machine for either testing or small-sized jobs, via either local mode (where everything is confined to a single process) or pseudodistributed mode (where you trick Hadoop into thinking it's running on multiple computers). 如果您的问题是您是否需要使用多机群集来使用Hadoop:不,您可以在一台计算机上运行Hadoop以进行测试或小型作业,通过本地模式(其中所有内容都局限于单进程)或伪分布模式(你欺骗Hadoop认为它在多台计算机上运行)。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM