简体   繁体   English

Hadoop发行版

[英]Hadoop distribution

I was using IBM big insights via VNC software (remote access) provided by the university I study but I can't access Internet through that desktop. 我正在通过我所研究的大学提供的VNC软件(远程访问)使用IBM的重要见解,但是我无法通过该桌面访问Internet。 To use some data samples available in internet, I decided to install Hadoop in my laptop (single cluster), but I found that there are many distributions, So What's the best free Hadoop distribution for training as a beginner ? 为了使用Internet上可用的一些数据样本,我决定在笔记本电脑(单个集群)中安装Hadoop ,但是我发现有很多发行版,那么对于初学者来说,最好的免费Hadoop发行版是什么?

1) Amazon Elastic MapReduce
2) Cloudera CDH Hadoop Distribution
3) Hortonworks Data Platform (HDP)
4) MapR Hadoop Distribution
5) IBM Open Platform
6) Microsoft Azure's HDInsight -Cloud based Hadoop Distrbution
7) Pivotal Big Data Suite
8) Datameer Professional
9) Datastax Enterprise Analytics
10) Dell- Cloudera Apache Hadoop Solution.

CDH and Hortonworks are the easiest to get a single node cluster up and running, and are also very widely used so you can find a lot of troubleshooting resources. CDH和Hortonworks是最容易启动和运行单节点群集的工具,并且使用非常广泛,因此您可以找到许多故障排除资源。

If you just want to write application code/run arbitrary MapReduce jobs rather than learn the Hadoop systems architecture, then Amazon EMR is more suitable. 如果您只想编写应用程序代码/运行任意MapReduce作业而不是学习Hadoop系统架构,那么Amazon EMR更适合。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM