简体   繁体   English

连接到CDH4.5群集

[英]Connect to a CDH4.5 Cluster

I have a CDH4.5 cluster, and I want to upload files into it from another server (eg database server). 我有一个CDH4.5集群,我想从另一个服务器(例如数据库服务器)将文件上传到其中。

With vanilla Hadoop and Hive, I can change the configuration files, pointing the namenode and metastore to remote services, and simply run: 使用香草Hadoop和Hive,我可以更改配置文件,将namenode和metastore指向远程服务,只需运行:

dba@db-001$ hadoop fs -copyFromLocal /path/to/export.tsv
dba@db-001$ hive -e "load data local inpath '/path/to/export.tsv' into table test.my_table"

But how about CDH? 但是CDH呢? What components should I install on other servers? 我应该在其他服务器上安装哪些组件?

  1. Download hadoop and hive tarballs corresponding to the CDH version: http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/latest/CDH-Version-and-Packaging-Information/cdhvd_topic_3.html#topic_3_1_unique_7 下载与CDH版本相对应的hadoop和蜂巢tarball: http : //www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/latest/CDH-Version-and-Packaging-Information/cdhvd_topic_3.html#topic_3_1_unique_7

  2. Download configuration files from cloudera manager, unzip them, and point HADOOP_CONF_DIR/HIVE_CONF_DIR to these folders. 从cloudera管理器下载配置文件,解压缩它们,然后将HADOOP_CONF_DIR / HIVE_CONF_DIR指向这些文件夹。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM