简体   繁体   English

如何通过随机选择2x2子阵列中的元素对2D阵列进行下采样?

[英]How to downsample a 2D array by randomly selecting elements in 2x2 sub-arrays?

I have a 2n x 2m numpy array. 我有一个2n x 2m的 numpy数组。 I would like to form a nxm array by selecting randomly one element in 2 x 2 non-overlapping sub-arrays that partition my initial array. 我想通过在2 x 2非重叠子阵列中随机选择一个元素来形成一个nxm数组,该子阵列分割我的初始数组。 What would be the best way to do so? 最好的方法是什么? Is there a way to avoid two for loops (one along each dimension)? 有没有办法避免两个for循环(每个维度一个)?

For example, if my array is 例如,如果我的数组是

1 2 3 4
5 6 7 8
9 0 1 2
8 5 7 0

then, there are four 2 x 2 sub-arrays that partition it: 那么,有四个2 x 2子阵列对它进行分区:

1 2    3 4
5 6    7 8

9 0    1 2
8 5    7 0

and I would like to pick up randomly one element in each of them to form new arrays, such as 我想在每个元素中随机拾取一个元素以形成新的数组,例如

5 3  ,  6 8  ,  2 3
9 2     9 1     0 0  .

Thank you for your time. 感谢您的时间。

This can be done by sampling. 这可以通过抽样来完成。 Instead of sampling each 2x2 square, we sample the entire ndarray into 4 separate ndarray, where the same index within those sub-arrays will point within the same 2x2 square. 我们不是对每个2x2平方进行采样,而是将整个ndarray采样为4个单独的ndarray,其中这些子阵列中的相同索引将指向相同的2x2平方。 And then we randomly sample from those 4 separate ndarray: 然后我们从那4个单独的ndarray中随机抽样:

# create test dataset
test = np.arange(36).reshape(6,6)

array([[ 0,  1,  2,  3,  4,  5],
       [ 6,  7,  8,  9, 10, 11],
       [12, 13, 14, 15, 16, 17],
       [18, 19, 20, 21, 22, 23],
       [24, 25, 26, 27, 28, 29],
       [30, 31, 32, 33, 34, 35]])

# Create subsamples from ndarray
samples = np.array([test[::2, ::2], test[1::2, 1::2], test[::2, 1::2], test[1::2, ::2]])
>>> samples
array([[[ 0,  2,  4],
        [12, 14, 16],
        [24, 26, 28]],

       [[ 7,  9, 11],
        [19, 21, 23],
        [31, 33, 35]],

       [[ 1,  3,  5],
        [13, 15, 17],
        [25, 27, 29]],

       [[ 6,  8, 10],
        [18, 20, 22],
        [30, 32, 34]]])

Now the same index of each of these 4 subsamples point to the same 2x2 square on the original ndarray. 现在,这4个子样本中每个子样本的相同索引指向原始ndarray上的相同2x2平方。 We just need to select from the same index randomly: 我们只需要随机选择相同的索引:

# Random choice sampling between these 4 subsamples.
select = np.random.randint(4,size=(3,3))
>>> select
array([[2, 2, 1],
       [3, 1, 1],
       [3, 0, 0]])

result = select.choose(samples)
>>> result

array([[ 1,  3, 11],
       [18, 21, 23],
       [30, 26, 28]])

I got blockshaped function from another answer . 我从另一个答案得到了blockhaped函数。 This answer assumes that size of your original array is appropriate for the operation. 此答案假定原始数组的大小适合于该操作。

import numpy as np

def blockshaped(arr, nrows, ncols):
    """
    Return an array of shape (n, nrows, ncols) where
    n * nrows * ncols = arr.size

    If arr is a 2D array, the returned array should look like n subblocks with
    each subblock preserving the "physical" layout of arr.
    """
    h, w = arr.shape
    return (arr.reshape(h//nrows, nrows, -1, ncols)
               .swapaxes(1,2)
               .reshape(-1, nrows, ncols))


arr = np.array([[1,2,3,4],[5,6,7,8],[9,0,1,2],[8,5,7,0]])

#  arr is an 2d array with dimension mxn
m = arr.shape[0]
n = arr.shape[1]

#  define blocksize
block_size = 2

#  divide into sub 2x2 arrays
#  blocks is a (Nx2x2) array
blocks = blockshaped(arr, block_size, block_size)

#  select random elements from each block to form new array
num_blocks = block_size**2
new_arr = blocks[np.arange(num_blocks), np.random.randint(low=0, high=2, size=num_blocks), np.random.randint(low=0, high=2,size=num_blocks)]

print("original array:")
print(arr)

print("random pooled array:")
print(new_arr)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM