Is there a way to broadcast boolean masks?

Question

I'm trying to reduce the number of calculations I do based on search distance. I have N nodes and an [NxN] boolean mask that tells me what nodes are within X distance of the other nodes with T true values.

I also have [Nx(d)] data for each node, where (d) can be (1) , (3) , or (3x3) . I want the "sparse" format which is a [Tx(d)] array so I can do vectorized calculations along the 0 axis. Right now I do this:

sparseData=data.repeat(data.shape[0],axis=0).reshape(np.concatenate(([data.shape[0],data.shape])))[mask]

Which works, but causes memory errors if N is too big, due to the [NxNx(d)] array I'm creating with .repeat Is there a way to broadcast this? If I do this:

data[None,...][mask]

It doesn't work, but it seems like there has to be a more efficient way to do this.

Answer 1

Instead of repeating the data you can make a view with numpy.broadcast_to :

sparseData = np.broadcast_to(data, (data.shape[0],) + data.shape)[mask]

However, even easier would be to select the rows of data based on index:

I, J = np.nonzero(mask)
sparseData = data[I]  # could also use J

Is there a way to broadcast boolean masks?

Question

1 answers

solution1
3 ACCPTED 2017-01-19 10:39:12

Is there a way to broadcast boolean masks?

Question

1 answers

solution1 3 ACCPTED 2017-01-19 10:39:12

solution1
3 ACCPTED 2017-01-19 10:39:12