简体   繁体   English

如何在Hadoop mapreduce中处理负载平衡?

[英]How can load balancing be handled in Hadoop mapreduce?

How can load balancing be handled in Hadoop mapreduce? 如何在Hadoop mapreduce中处理负载平衡? I am writing a distributed application in which the server distributes jobs to worker nodes based on a benchmark test, memory available, number of CPU cores, CPU usage, number of GPUs available / usage? 我正在编写一个分布式应用程序,其中服务器根据基准测试,可用内存,CPU内核数量,CPU使用率,可用GPU数量/使用量将作业分配给工作节点? I am not very experienced with mapreduce and have read some documentation on apache's website but am still not sure how to go about and solve this problem. 我对mapreduce不是很有经验,并且已经在apache的网站上阅读了一些文档,但我仍然不确定如何去解决这个问题。 Can I do the benchmark calculation and get this all of this information and then by an algorithm to dynamically split up the input? 我可以进行基准计算并获得所有这些信息,然后通过算法动态分割输入吗?

Thank you! 谢谢!

"MapReduce is a programming model and an associated implementation for processing and generating large data sets" extract of the abstract of MapReduce paper. “MapReduce是一种编程模型,是处理和生成大型数据集的相关实现”,是MapReduce论文摘要的摘录。

As you said it in comments, it seems your project is not data intensive but computing intensive, thus I think MapReduce is not the tool you need to use. 正如您在评论中所说的那样,您的项目似乎不是数据密集型而是计算密集型,因此我认为MapReduce不是您需要使用的工具。

Performance of MapReduce systems strongly depends on an even data distribution. MapReduce系统的性能很大程度上取决于均匀的数据分布。 Apache MapReduce frameworks use a simplistic approach to distribute the work load and assign the same number of clusters to each reducer. Apache MapReduce框架使用简单的方法来分配工作负载,并为每个reducer分配相同数量的集群。

The load imbalance, which raises the processing time, is even amplified by the high runtime complexities of the reducer tasks. 负载不平衡可以提高处理时间,甚至可以通过减速器任务的高运行时复杂性进行放大。 An adaptive load balancing strategy is required to address the problem of estimating the cost of the tasks that are distributed to the reducers based on a given cost model. 需要自适应负载平衡策略来解决基于给定成本模型估计分配给减速器的任务的成本的问题。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM