简体   繁体   中英

How to install cloudera impala on EMR?

Is there anyway i can install the only impala without cloudera manager and without cdh. I will be using the apache version of hadoop?

Yes, it is absolutely possible. Add the repository into your sources.list file and update the repository after that.

deb [arch=amd64] http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala precise-impala1 contrib deb-src http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala precise-impala1 contrib

After that, it's merely :

sudo apt-get install impala                (Binaries for daemons)
sudo apt-get install impala-server         (Service start/stop script)
sudo apt-get install impala-state-store    (Service start/stop script)

But do not forget to meet all the prerequisites. For a detailed info you can go here

您可以在此处查看有关如何将Impala与Amazon EMR一起安装和使用的详细说明: http : //docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-impala.html

EMR is based on a Amazon Hadoop distribution that runs on top of Debian squeeze. So, yes it's possible using Cloudera's DEB repo.

You will need to SSH to your EMR master node, find the address on EMR console.

You will also need to enable security rules on the security group you have assigned to your EMR cluster, if you intend to connect to Impala using a JDBC/ODBC client form the outside world.

在EMR上检查一下Impala 2.2

https://github.com/aws-samples/emr-bootstrap-actions/tree/master/impala

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM