简体繁体 English

Hadoop：为什么编写一个精简的精简任务会超时？

[英]Hadoop: Why might a furiously writing reduce task be timed out?

原文 2011-10-04 16:52:20 3 1 java/ timeout/ hadoop

I have a Hadoop reduce task that reads its input records in batches and does a lot of processing and writes a lot of output for each input batch. 我有一个Hadoop reduce任务，该任务分批读取其输入记录，并进行大量处理，并为每个输入批处理写入大量输出。 I have read that Hadoop considers writing output to be "progress" for the purpose of killing hung tasks. 我已经读到 Hadoop认为将输出写为“进度”是为了杀死挂起的任务。 However, despite constantly writing lots of output, my task is still being timed out and killed. 但是，尽管不断编写大量输出，但我的任务仍然超时并被终止。 So: how can I find out when Hadoop thinks a task last reported progress? 那么：如何确定Hadoop认为任务上次报告进度的时间？ Why would I have to call context.progress() with every context.write() ? 为什么我会打电话给context.progress()每context.write() Are there any situations where writing is not counted as progress? 在任何情况下写作都不算进步吗？ (For instance, my keys are NullWritable s.) I'm using Cloudera CDH3u1 on CentOS 5.7 if that makes any difference. （例如，我的密钥是NullWritable 。）如果这有任何区别，我正在CentOS 5.7上使用Cloudera CDH3u1。

1 个解决方案

Not sure why the tasks are getting killed, but you could increase the value mapreduce.task.timeout , it's defaulted to 600000 ms. 不知道为什么任务被杀死，但是您可以增加值mapreduce.task.timeout ，默认值为600000 ms。 This might not be a good practice as rouge tasks will run for more time because of the increase in the timeout value. 这可能不是一个好习惯，因为超时值会增加，胭脂任务将运行更多时间。