简体   繁体   English

Hortonworks沙箱中的流Hadoop jar文件流,无contrib目录

[英]Streaming Hadoop jar file in Hortonworks sandbox, no contrib directory

I'm demoing hadoop in the hortonworks sandbox virtual machine . 我在hortonworks沙箱虚拟机中演示hadoop。

I've previously set up simple elastic map reduce streaming jobs on EC2, following patterns like this , or this . 我以前已经按照thisthis的 模式在EC2上设置了简单的弹性映射减少流作业。

However, it seems I have no streaming jar installed---in fact, I seem to be lacking many of the basic directories required in: 但是,似乎我没有安装流传输罐-实际上,我似乎缺少许多必需的基本目录:

$HADOOP_HOME/mapred/contrib/

My ls -lah actually looks like this: 我的ls -lah实际上看起来像这样:

[root@sandbox ~]# ls -lah
total 60K
dr-xr-x---.  5 root root 4.0K Apr 10 18:52 .
dr-xr-xr-x. 24 root root 4.0K Apr 10 18:31 ..
-rw-------   1 root root  126 Oct 28 08:35 .bash_history
-rw-r--r--.  1 root root   18 May 20  2009 .bash_logout
-rw-r--r--.  1 root root  176 May 20  2009 .bash_profile
-rw-r--r--   1 root root  262 Oct 28 08:29 .bashrc
-rw-r--r--.  1 root root  100 Sep 22  2004 .cshrc
-rw-r--r--   1 root root    0 Oct 28 08:34 .hdfs_prepared
drwxr-xr-x   2 root root 4.0K Apr 10 18:22 .pip
drwxr-----   3 root root 4.0K Oct 20 16:21 .pki
-rw-------   1 root root 1.0K Oct 20 14:04 .rnd
drwx------   2 root root 4.0K Oct 20 09:21 .ssh
lrwxrwxrwx   1 root root   48 Oct 28 08:28 start_ambari.sh -> /usr/lib/hue/tools/start_scripts/start_ambari.sh
lrwxrwxrwx   1 root root   47 Oct 28 08:28 start_hbase.sh -> /usr/lib/hue/tools/start_scripts/start_hbase.sh
-rw-r--r--.  1 root root  129 Dec  3  2004 .tcshrc
-rw-------   1 root root 4.8K Oct 28 08:30 .viminfo
-rw-r--r--   1 root root  218 Oct 20 08:55 zero_machine.sh

Although using the hadoop commands, I see that the mapred exists, but contains no contrib. 尽管使用hadoop命令,但我看到mapred存在,但不包含任何贡献。

[root@sandbox ~]# hadoop fs -ls /
Found 6 items
drwxrwxrwt   - yarn   hadoop          0 2014-04-10 19:14 /app-logs
drwxr-xr-x   - hdfs   hdfs            0 2013-10-20 15:08 /apps
drwxr-xr-x   - mapred hdfs            0 2013-10-20 15:10 /mapred
drwxr-xr-x   - hdfs   hdfs            0 2013-10-20 15:10 /mr-history
drwxrwxrwx   - hdfs   hdfs            0 2013-10-28 08:34 /tmp
drwxr-xr-x   - hdfs   hdfs            0 2013-10-28 08:34 /user
[root@sandbox ~]# hadoop fs -ls /mapred/
Found 1 items
drwxr-xr-x   - mapred hdfs          0 2013-10-20 15:10 /mapred/system

Is there a special download page for the streaming .jar file? 流.jar文件有一个特殊的下载页面吗? When I access the link at the bottom of this page , the link to streaming is dead . 当我访问此页面底部的链接时, 指向流链接已失效

The default location of hadoop streaming jar is /usr/lib/hadoop/contrib/streaming/hadoop-streaming-*.jar . hadoop流jar的默认位置是/usr/lib/hadoop/contrib/streaming/hadoop-streaming-*.jar

If you cannot find the jar in the above mentioned location. 如果在上述位置找不到罐子。 you could download hadoop-streaming-*.jar from the following Hortonworks repository: 您可以从以下Hortonworks存储库中下载hadoop-streaming-*。jar:

http://repo.hortonworks.com/content/repositories/releases/org/apache/hadoop/hadoop-streaming/

There is a convenience link /usr/lib/hadoop-mapreduce/hadoop-streaming.jar to that jar. 有一个到该jar的便捷链接/usr/lib/hadoop-mapreduce/hadoop-streaming.jar I tested it in Hortonworks Sandbox 2.1 我在Hortonworks Sandbox 2.1中进行了测试

#ls -AlF /usr/lib/hadoop-mapreduce/hadoop-streaming.jar
lrwxrwxrwx 1 root root 38 Apr 21 06:57 /usr/lib/hadoop-mapreduce/hadoop-streaming.jar -> hadoop-streaming-2.4.0.2.1.1.0-385.jar

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM