簡體   English   中英

Tez-DAGAppMaster-java.lang.IllegalArgumentException:無效的ContainerId

[英]Tez - DAGAppMaster - java.lang.IllegalArgumentException: Invalid ContainerId

我嘗試啟動mapreduce作業,但是在執行shell或hive中的作業時遇到錯誤:

蜂巢>從員工中選擇count(*); 查詢ID = mapr_20171107135114_a574713d-7d69-45e1-aa73-d4de07a3059b總作業= 1啟動1作業(共1項)在編譯時確定的reduce任務數:1為了更改reducer的平均負載(以字節為單位):設置hive。 exec.reducers.bytes.per.reducer =為了限制最大數量的減速器:設置hive.exec.reducers.max =為了設置恆定數量的減速器:set mapreduce.job.reduces =啟動Job = job_1510052734193_0005 ,跟蹤URL = http://hdpsrvpre2.intranet.darty.fr:8088/proxy/application_1510052734193_0005/ Kill命令= /opt/mapr/hadoop/hadoop-2.7.0/bin/hadoop job -kill job_1510052734193_0005 Stage的Hadoop作業信息-1:映射器數量:0; 減速器數量:0 2017-11-07 13:51:25,951 Stage-1映射= 0%,reduce = 0%結束的作業= job_1510052734193_0005,有錯誤作業期間出錯,獲取調試信息... **失敗:執行錯誤,從org.apache.hadoop.hive.ql.exec.mr.MapRedTask返回代碼2啟動的MapReduce作業:Stage-Stage-1:MAPRFS讀取:0 MAPRFS寫入:0 FAIL總MapReduce使用的CPU時間:0毫秒

在Ressourcemanager日志中,我發現了什么:

>     2017-11-07 13:51:25,269 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl:
> appattempt_1510052734193_0005_000002 State change from LAUNCHED to
> FINAL_SAVING 2017-11-07 13:51:25,269 INFO
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore:
> Updating info for attempt: appattempt_1510052734193_0005_000002 at:
> /var/mapr/cluster/yarn/rm/system/FSRMStateRoot/RMAppRoot/application_1510052734193_0005/appattempt_1510052734193_0005_000002
> 2017-11-07 13:51:25,283 INFO
> org.apache.hadoop.yarn.server.resourcemanager.ApplicationMasterService:
> Unregistering app attempt : appattempt_1510052734193_0005_000002
> 2017-11-07 13:51:25,283 INFO
> org.apache.hadoop.yarn.server.resourcemanager.security.AMRMTokenSecretManager:
> Application finished, removing password for
> appattempt_1510052734193_0005_000002 2017-11-07 13:51:25,283 **INFO
> org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl:
> appattempt_1510052734193_0005_000002 State change from FINAL_SAVING to
> FAILED** 2017-11-07 13:51:25,284 INFO
> org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl: The
> number of failed attempts is 2. The max attempts is 2 2017-11-07
> 13:51:25,284 INFO
> org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl:
> Updating application application_1510052734193_0005 with final state:
> FAILED 2017-11-07 13:51:25,284 INFO
> org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl:
> application_1510052734193_0005 State change from ACCEPTED to
> FINAL_SAVING 2017-11-07 13:51:25,284 INFO
> org.apache.hadoop.yarn.server.resourcemanager.recovery.RMStateStore:
> Updating info for app: application_1510052734193_0005 2017-11-07
> 13:51:25,284 INFO
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler:
> Application appattempt_1510052734193_0005_000002 is done.
> finalState=FAILED 2017-11-07 13:51:25,284 INFO
> org.apache.hadoop.yarn.server.resourcemanager.recovery.FileSystemRMStateStore:
> Updating info for app: application_1510052734193_0005 at:
> /var/mapr/cluster/yarn/rm/system/FSRMStateRoot/RMAppRoot/application_1510052734193_0005/application_1510052734193_0005
> 2017-11-07 13:51:25,284 INFO
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.AppSchedulingInfo:
> Application application_1510052734193_0005 requests cleared 2017-11-07
> 13:51:25,296 INFO
> org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl:
> Application application_1510052734193_0005 failed 2 times due to AM
> Container for appattempt_1510052734193_0005_000002 exited with
> exitCode: 1 For more detailed output, check application tracking
> page:http://hdpsrvpre2.intranet.darty.fr:8088/cluster/app/application_1510052734193_0005Then,
> click on links to logs of each attempt. Diagnostics: Exception from
> container-launch. Container id:
> container_e10_1510052734193_0005_02_000001 Exit code: 1 Stack trace:
> ExitCodeException exitCode=1: at
> org.apache.hadoop.util.Shell.runCommand(Shell.java:545) at
> org.apache.hadoop.util.Shell.run(Shell.java:456) at
> org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:722)
> at
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.launchContainer(LinuxContainerExecutor.java:304)
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:354)
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:87)
> at java.util.concurrent.FutureTask.run(FutureTask.java:262) at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1152)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:622)
> at java.lang.Thread.run(Thread.java:748) Shell output: main : command
> provided 1 main : user is mapr main : requested yarn user is mapr
>     
>     Container exited with a non-zero exit code 1 Failing this attempt. Failing the application.

另外,在作業的系統日志中,我發現:

2017-11-07 12:09:46,419 FATAL [main] app.DAGAppMaster:啟動DAGAppMaster java.lang.IllegalArgumentException時出錯:無效的ContainerId:在org.apache.hadoop.yarn.util.ConverterUtils.toContainerId(ConverterUtils.java 182)at org.apache.tez.dag.app.DAGAppMaster.main(DAGAppMaster.java:1794)原因:java.lang.NumberFormatException:對於輸入字符串:“ e10”
org.apache處的java.lang.Long.parseLong(Long.java:441)處的java.lang.Long.parseLong(Long.java:441)處的java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)處。 hadoop.yarn.util.ConverterUtils.toApplicationAttemptId(ConverterUtils.java:137)在org.apache.hadoop.yarn.util.ConverterUtils.toContainerId(ConverterUtils.java:177)... 1更多

似乎是導致問題的Tez,有解決方案嗎? 謝謝 !

我認為執行環境具有不同版本的hadoop及其各自的jar文件。

請驗證環境,並確保僅使用所需的版本,並從任何環境變量中刪除其他版本的引用。

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM