Python如何提高numpy數組的性能？

Question

我有一個全局numpy.array 數據，它是一個200 * 200 * 3 3d數組，在3d空間中包含40000個點。

我的目標是計算每個點到單位立方體四個角的距離（（0，0，0），（1，0，0），（0，1，0），（0，0，1）），因此我可以確定哪個角離它最近。

def dist(*point):
    return np.linalg.norm(data - np.array(rgb), axis=2)

buffer = np.stack([dist(0, 0, 0), dist(1, 0, 0), dist(0, 1, 0), dist(0, 0, 1)]).argmin(axis=0)

我編寫了這段代碼並對其進行了測試，每次運行大約花費10毫秒。 我的問題是如何改善這段代碼的性能，最好在不到1ms的時間內運行。

Answer 1

您可以使用Scipy cdist

# unit cube coordinates as array
uc = np.array([[0, 0, 0],[1, 0, 0], [0, 1, 0], [0, 0, 1]])

# buffer output
buf = cdist(data.reshape(-1,3), uc).argmin(1).reshape(data.shape[0],-1)

運行時測試

# Original approach
def org_app():
    return np.stack([dist(0, 0, 0), dist(1, 0, 0), \
       dist(0, 1, 0), dist(0, 0, 1)]).argmin(axis=0)

時間-

In [170]: data = np.random.rand(200,200,3)

In [171]: %timeit org_app()
100 loops, best of 3: 4.24 ms per loop

In [172]: %timeit cdist(data.reshape(-1,3), uc).argmin(1).reshape(data.shape[0],-1)
1000 loops, best of 3: 1.25 ms per loop

Python如何提高numpy數組的性能？

問題描述

1 個解決方案

解決方案1
3 已采納 2017-08-01 13:41:08

Python如何提高numpy數組的性能？

問題描述

1 個解決方案

解決方案1 3 已采納 2017-08-01 13:41:08

解決方案1
3 已采納 2017-08-01 13:41:08