简体繁体 English

有关调试Hadoop MapReduce作业中的辅助排序问题的任何技巧？

[英]Any tips on debugging problems with the secondary sort in Hadoop MapReduce job?

原文 2012-02-10 15:05:07 1 1 java/ sorting/ hadoop/ mapreduce

I believe (believed?) I understand how secondary sort works in Hadoop. 我相信（相信吗？）我了解二级排序在Hadoop中的工作方式。 I created an intermediate key consisting of 4 fields. 我创建了一个由4个字段组成的中间键。 I partition by the first field, group by the first and second, and sort by all 4. 我按第一个字段分区，按第一个和第二个字段分组，然后按全部4排序。

It looks like I nailed grouping and partitioning down, but the values come into reducer out of order. 看起来我钉住了分组和分区，但是这些值进入了reducer的混乱状态。

Any ideas on how to approach debugging of this? 关于如何进行此调试的任何想法？

1 个解决方案

At the moment, it appears that static code review either manually or using tools works good. 目前看来，手动或使用工具进行静态代码审查都很好。 I believe I broke the rule: when overriding compareTo() , don't forget to override equals() and hashCode() . 我相信我违反了规则：覆盖compareTo() ，不要忘记覆盖equals()和hashCode() 。 I'll keep everyone posted if fixing this solved the problem. 如果解决此问题，我会通知所有人。

Hadoop中的二级排序 - Secondary Sort in Hadoop

MapReduce Hadoop作业的总体进度 - Overall Progress of MapReduce Hadoop Job

Java Hadoop MapReduce 链接作业 - Java Hadoop MapReduce Chaining Job

hadoop：无法运行mapreduce作业 - hadoop:Not able to run a mapreduce job

hadoop mapReduce工作从未完成 - hadoop mapReduce job never finished

Hadoop MapReduce - 按值排序和排序 - Hadoop MapReduce - Sum and Sort by value

Hadoop：二级排序不起作用 - Hadoop: Secondary sort does not work

mapreduce二级排序不起作用 - mapreduce secondary sort doesn't work

Hadoop-MapReduce中的调试。映射器没有被调用？ - Debugging in Hadoop - MapReduce . Mapper not being called?

即使没有任何意义，如何将 Hadoop mapreduce 作业实现为非 map/reduce？ - How to implement Hadoop mapreduce job as non map/reduce even if does not make any sense?

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Hadoop中的二级排序 - Secondary Sort in Hadoop MapReduce Hadoop作业的总体进度 - Overall Progress of MapReduce Hadoop Job Java Hadoop MapReduce 链接作业 - Java Hadoop MapReduce Chaining Job hadoop：无法运行mapreduce作业 - hadoop:Not able to run a mapreduce job hadoop mapReduce工作从未完成 - hadoop mapReduce job never finished Hadoop MapReduce - 按值排序和排序 - Hadoop MapReduce - Sum and Sort by value Hadoop：二级排序不起作用 - Hadoop: Secondary sort does not work mapreduce二级排序不起作用 - mapreduce secondary sort doesn't work Hadoop-MapReduce中的调试。映射器没有被调用？ - Debugging in Hadoop - MapReduce . Mapper not being called? 即使没有任何意义，如何将 Hadoop mapreduce 作业实现为非 map/reduce？ - How to implement Hadoop mapreduce job as non map/reduce even if does not make any sense?

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM