并行化受CPU约束的Python函数

Question

I have a CPU-bound Python function that takes around 15 seconds to run on a standard core. 我有一个受CPU约束的Python函数，在标准内核上运行大约需要15秒。 I need to run this function tens of thousands of times. 我需要运行此功能数万次。 The function input is a dataset around 10kB in size, so data transfer time should be negligible compared to the runtime. 函数输入的数据集大小约为10kB，因此与运行时相比，数据传输时间应忽略不计。 The functions do not need to communicate with each other. 这些功能不需要相互通信。 The return value is a small array. 返回值是一个小数组。

I do not need to synchronize these functions at all. 我完全不需要同步这些功能。 All I care about is that when one core finishes, it gets delegated a new job. 我所关心的是，当一个核心完成时，它被委派了一份新工作。

What is a good framework to start parallelizing this problem with? 什么是开始对此问题进行并行化处理的良好框架？ I would like to be able to run this on my own computers and also Amazon units. 我希望能够在我自己的计算机以及Amazon单元上运行它。

Would Python's multiprocessing module do the trick? Python的多处理模块会成功吗？ Would I be better off with something other than that? 除此之外，我会更好吗？

Answer 1

if no communication needed - simplest way is Pool.map. 如果不需要通信-最简单的方法是Pool.map。 It like map function, but iterations processed in one of child process. 它类似于map函数，但是迭代在子进程之一中处理。

import multiprocessing
pool = multiprocessing.Pool(processes=4)
def fu(chunk):
    #your code here
    return result

def produce_data(data):
    while data:
        #you need to split data
        yield chunk

result = pool.map(fu,produce_data(data))
# result will be ordered list of results for each chunk

There is few several ways to process data with multiprocessing. 几乎没有几种方法可以通过多处理来处理数据。

并行化受CPU约束的Python函数

问题描述

1 个解决方案

解决方案1
2 已采纳 2013-08-12 04:44:23

并行化受CPU约束的Python函数

问题描述

1 个解决方案

解决方案1 2 已采纳 2013-08-12 04:44:23

解决方案1
2 已采纳 2013-08-12 04:44:23