简体   繁体   English

如何设置HDInsight的输入/输出路径以指向本地存储模拟器中的Azure Storage Vault(ASV)?

[英]How to set HDInsight's input/output paths to point to Azure Storage Vault (ASV) in local storage emulator?

I am trying to create a simple map/reducer job for hdinsight but I am having trouble with connecting job in local hdinsight hadoop cluster to local azure storage emulator via ASV:// 我正在尝试为hdinsight创建一个简单的map / reducer作业,但是我无法通过ASV://将本地hdinsight hadoop集群中的作业连接到本地azure存储模拟器

Sample code like this: 样例代码如下:

var hadoop = Hadoop.Connect();
HadoopJobConfiguration config = new HadoopJobConfiguration();
config.InputPath = @"asv://127.0.0.1:10000/devstoreaccount1/testcontainer/input";
config.OutputFolder = "asv://127.0.0.1:10000/devstoreaccount1/testcontainer/output";
hadoop.MapReduceJob.Execute<Mapper1>(config);

or 要么

var hadoop = Hadoop.Connect();
HadoopJobConfiguration config = new HadoopJobConfiguration();
config.InputPath = @"asv://testcontainer/input";
config.OutputFolder = "asv://testcontainer/output";
hadoop.MapReduceJob.Execute<Mapper1>(config);

where testcontainer is an existing container in local emulator's blob storage. 其中testcontainer是本地仿真器的Blob存储中的现有容器。 When running this code I get a StreamingException "Process failed ('Streaming Job Failed!')" 运行此代码时,我得到StreamingException “进程失败(“流作业失败!”)

And see the following error in console output: 并在控制台输出中看到以下错误:

packageJobJar: [] [/C:/Hadoop/hadoop-1.1.0-SNAPSHOT/lib/hadoop-streaming.jar] C: \Users\user1\AppData\Local\Temp\1\streamjob6525516867576872971.jar tmpDir=null 
13/08/26 12:30:51 INFO util.NativeCodeLoader: Loaded the native-hadoop library 
13/08/26 12:30:51 WARN snappy.LoadSnappy: Snappy native library not loaded 
13/08/26 12:30:51 INFO mapred.JobClient: Cleaning up the staging area hdfs://loc alhost:8020/hadoop/hdfs/tmp/mapred/staging/user1/.staging/job_201308221229_0010 
13/08/26 12:30:51 ERROR security.UserGroupInformation: PriviledgedActionExceptio n as:user1 cause:org.apache.hadoop.mapred.InvalidInputException: Input path does  not exist: hdfs:/devstoreaccount1/testcontainer/input 
13/08/26 12:30:51 ERROR streaming.StreamJob: Error Launching job : Input path do es not exist: hdfs:/devstoreaccount1/testcontainer/input 

Streaming Command Failed!

Any hints? 有什么提示吗?

You can use one of the following paths to access local storage emulator. 您可以使用以下路径之一访问本地存储模拟器。

wasb://testcontainer@storageemulator/input wasb:// testcontainer @ storageemulator / input

wasb:///input (when storage emulator is set as the default File System) wasb:/// input(将存储模拟器设置为默认文件系统时)

asv:///input (when storage emulator is set as the default File System, asv is obsolete, not recommended) asv:/// input(将存储模拟器设置为默认文件系统时,asv已过时,不建议使用)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM