[英]Hadoop reduce jobs completed successfully but processes remains and taking memory
There is a 4 node hadoop cluster.有一个 4 节点的 hadoop 集群。 One of the main node have a strange bug that is the reduce processes finished the job, but it did not terminate.
其中一个主节点有一个奇怪的错误,即reduce进程完成了作业,但没有终止。 It makes the memory exceed quickly.
它使内存超速。
I don't know what cause this problem but it only occurs on one node, others are fine.我不知道是什么导致了这个问题,但它只发生在一个节点上,其他节点都很好。 There is also no any error report of the job, just a processes doing nothing.
作业也没有任何错误报告,只是一个什么都不做的进程。
I found the problem was.我发现问题是。 I used a library that would open multi-thread when heavy load.
我使用了一个可以在重载时打开多线程的库。 If the threads never stop, the processes remains running either.
如果线程永不停止,进程也将继续运行。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.