[英]How to gather arrays of unequal length with mpi4py
我正在嘗試在不同節點上獲取許多不同長度的列表,將它們收集在一個節點中,然后讓主節點將它們放在一組中。 該列表在每個節點中均名為rout_array
。 請注意, rout_array
中的元素只是整數,並且在節點之間不是唯一的。
Traceback (most recent call last):
File "prout.py", line 160, in <module>
main()
File "prout.py", line 153, in main
num = DetermineRoutingNumber(steps, goal, vertexSetSize)
File "prout.py", line 129, in DetermineRoutingNumber
comm.Gather(send_buffer, recv_buffer, root = 0)
File "MPI\Comm.pyx", line 589, in mpi4py.MPI.Comm.Gather (c:\projects\mpi4py\src\mpi4py.MPI.c:97806)
File "MPI\msgbuffer.pxi", line 525, in mpi4py.MPI._p_msg_cco.for_gather (c:\projects\mpi4py\src\mpi4py.MPI.c:34678)
File "MPI\msgbuffer.pxi", line 446, in mpi4py.MPI._p_msg_cco.for_cco_send (c:\projects\mpi4py\src\mpi4py.MPI.c:33938)
File "MPI\msgbuffer.pxi", line 148, in mpi4py.MPI.message_simple (c:\projects\mpi4py\src\mpi4py.MPI.c:30349)
File "MPI\msgbuffer.pxi", line 93, in mpi4py.MPI.message_basic (c:\projects\mpi4py\src\mpi4py.MPI.c:29448)
KeyError: 'O'
當我的代碼中沒有字符串時,我不知道如何獲取'O'
的KeyError。 所有列表均包含整數,numpy數組包含整數,並且此處唯一活動的字典僅包含鍵的整數。 應當注意,每個節點都輸出此錯誤。
import numpy, math
from mpi4py import MPI
from sympy.combinatorics import Permutation as Perm
def GetEdges(size,file):
"""This function takes in a file of edges in a graph in the form 'u,v'
without quotes, where u and v are vertices of the graph. It then
generates a permutation that swaps those vertices, and returns these
transpositions."""
edgeFile = open(file, "r")
edges = []
for line in edgeFile:
line = line.strip()
line = line.split(",")
for vertex in line:
line[line.index(vertex)] = int(vertex)
edges.append(Perm([line], size = size))
edgeFile.close()
edges.append(Perm([[size - 1]], size = size))
return edges
def AreDisjoint(p1,p2):
"""This function determines whether or not two permutations move any
common elements, and returns the appropriate boolean."""
v1 = set(p1.support())
v2 = set(p2.support())
return len(v1 & v2) == 0
def GetMatchings(edges, maxMatching, size):
"""This function takes in a set of edges given by GetEdges(), and
generates all possible matchings in the given graph. It then converts
each matching into its rank given by lexicographical order, and appends
that rank to a set, which is then returned."""
stepDict = {1:set(edges)}
steps = set(edges)
for i in range(1,maxMatching):
temp = set()
for p1 in stepDict[1]:
for p2 in stepDict[i]:
newPerm = p1 * p2
if AreDisjoint(p1,p2) and newPerm not in steps:
temp.add(newPerm)
steps.add(newPerm)
stepDict[i+1] = temp
newSteps = set()
for step in steps:
newSteps.add(step.rank())
return newSteps
def FromRank(rank,level):
"""This function takes in a rank and size of a permutation, then returns
the permutation that lies at the rank according to lexicographical
ordering. """
lst = list(range(level + 1))
perm = []
while lst:
fact = math.factorial(len(lst) - 1)
index, rank = divmod(rank, fact)
perm.append(lst.pop(index))
assert rank == 0
return perm
def SplitArrayBetweenNodes(rank, rem, length):
"""This function takes in the rank of a node and any remainder after
dividing up an array between all the nodes. It then returns a starting
and ending partition index unique to each node."""
if rem != 0:
if rank in list(range(rem)):
if rank == 0:
part_start = 0
part_end = length
else:
part_start = rank * (length + 1)
part_end = part_start + length
else:
part_start = rank * length + rem
part_end = part_start + length - 1
else:
part_start = rank * length
part_end = part_start + length - 1
return part_start, part_end
def DetermineRoutingNumber(steps, goal, vertexSetSize):
"""This function takes in the matchings created by GetMatchings(),
and calculates all possible products between its own elements. It then
takes all unique products, and calculates all possible prducts between
the matching set and the previous output. This repeats until all
permutations of a given type are found. The level at which this occurs
is then returned."""
comm = MPI.COMM_WORLD
rank = comm.Get_rank()
size = comm.Get_size()
length = len(steps)
rem = length % size
part_len = length // size
part_start, part_end = SplitArrayBetweenNodes(rank,rem, part_len)
permDict = {1: steps}
i = 1
while True:
rout_array = set()
work_array = set(list(permDict[i])[part_start:part_end + 1])
#Calculate all possible products
for p1 in permDict[1]:
for p2 in work_array:
p2_perm = Perm(FromRank(p2,vertexSetSize - 1))
p1_perm = Perm(FromRank(p1,vertexSetSize - 1))
new = p2_perm * p1_perm
if new(0) == 0 or new(0) == 1:
order = new.rank()
rout_array.add(order)
#All nodes send their work to master node
comm.Barrier()
send_buffer = numpy.array(rout_array)
sendcounts = numpy.array(comm.gather(len(rout_array), root = 0))
if rank == 0:
recv_buffer = numpy.empty(sum(sendcounts), dtype = int)
else:
recv_buffer = None
comm.Gatherv(sendbuf = send_buffer, recvbuf = (recv_buffer, sendcounts), root = 0)
#Generate input for next level of the loop, and weed out repeats.
permDict[i+1] = rout_array
for j in range(1,i+1):
permDict[i+1] = permDict[i+1] - permDict[j]
def main():
file = "EdgesQ2.txt"
maxMatching = 2
vertexSetSize = 4
edges = GetEdges(vertexSetSize, file)
steps = GetMatchings(edges, maxMatching, vertexSetSize)
goal = 2 * math.factorial(vertexSetSize-1)
num = DetermineRoutingNumber(steps, goal, vertexSetSize)
print(num)
main()
請注意,在此示例中, maxMatching = 2
且vertexSetSize = 4
。 輸出應為3
。
0,1
1,2
2,3
0,3
請注意,在此示例中, maxMatching = 4
, vertexSetSize = 8
。 輸出應為4
。
0,1
0,3
0,4
1,2
1,5
2,3
2,6
3,7
4,5
4,7
5,6
6,7
如果跨進程的長度不同,則需要使用向量變量Gatherv
。 使用該函數,您可以提供一個包含各種長度(recvcounts)的數組。
不幸的是,mpi4py文檔當前未描述如何使用Gatherv
或任何其他向Gatherv
體。 這是一個簡單的示例:
#!/usr/bin/env python3
import numpy as np
from mpi4py import MPI
import random
comm = MPI.COMM_WORLD
rank = comm.Get_rank()
root = 0
local_array = [rank] * random.randint(2, 5)
print("rank: {}, local_array: {}".format(rank, local_array))
sendbuf = np.array(local_array)
# Collect local array sizes using the high-level mpi4py gather
sendcounts = np.array(comm.gather(len(sendbuf), root))
if rank == root:
print("sendcounts: {}, total: {}".format(sendcounts, sum(sendcounts)))
recvbuf = np.empty(sum(sendcounts), dtype=int)
else:
recvbuf = None
comm.Gatherv(sendbuf=sendbuf, recvbuf=(recvbuf, sendcounts), root=root)
if rank == root:
print("Gathered array: {}".format(recvbuf))
如您所見,mpi4py不會將sendcount或recvcounts作為額外的參數,而是作為recvbuf
參數中的元組/列表。 如果傳遞(recvbuf, sendcounts)
,它將從recvbuf派生類型。 將進行位移/偏移,以便將所有等級的數據連續存儲並按等級排序。
基本上,mpi4py會瘋狂猜測各種形式的recvbuf
參數可能意味着什么。 完整明確的形式是(buffer, counts, displacements, type)
。
KeyError
: 一個相當混亂的命名為rout_array
是一個set
,它不是numpy.array
的有效輸入。 set
既不是序列,也沒有數組接口。 不幸的是, numpy.array
沒有失敗, numpy.array
創建了一個非常奇怪的沒有尺寸的ndarray
對象。 您可以將數組創建包裝在列表中:
send_buffer = numpy.array(list(rout_array))
集體工作,但是循環不會終止,考慮到在DetermineRoutingNumber
中的while true
循環中沒有return
或break
,這並不奇怪。
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.