创建一个numpy.ndarray，其元素由子类numpy.ndarray组成

Question

I am trying to create a numpy array of subclassed numpy arrays. 我正在尝试创建一个numpy数组的子类numpy数组。 Unfortunately, when I create my new array of subclasses, numpy automatically upcasts the elements of my array to numpy.ndarray . 不幸的是，当我创建我的子类的新阵，numpy的自动upcasts我的数组中的元素numpy.ndarray 。

The code below shows what I am trying to do. 下面的代码显示了我想要做的事情。 dummy_class inherits from numpy.ndarray and contains some extra functionality(which is not important for the problem at hand). dummy_class继承自numpy.ndarray并包含一些额外的功能（这对于手头的问题并不重要）。 I create two new arrays using the dummy_class constructor and want to put each of these subclassed arrays in a new numpy_ndarray . 我使用dummy_class构造函数创建了两个新数组，并希望将每个子类数组放在一个新的numpy_ndarray 。 When the problematic array gets initialized, the type of the subclassed arrays gets automatically upcast from dummy_class to numpy.ndarray . 当有问题的数组被初始化时，子类化数组的类型会自动从dummy_class 向上 dummy_class为numpy.ndarray 。 Some code to reproduce the problem can be found below 可以在下面找到重现问题的一些代码

import numpy

class dummy_class(numpy.ndarray):
    def __new__(cls, data, some_attribute):
        obj = numpy.asarray(data).view(cls)
        obj.attribute = some_attribute
        return obj

array_1 = dummy_class([1,2,3,4], "first dummy")
print type(array_1)
# <class '__main__.dummy_class'>

array_2 = dummy_class([1,2,3,4], "second dummy")
print type(array_2)
# <class '__main__.dummy_class'>

the_problem = numpy.array([array_1, array_2])
print type(the_problem)
# <type 'numpy.ndarray'>
print type(the_problem[0])
# <type 'numpy.ndarray'>
print type(the_problem[1])
# <type 'numpy.ndarray'>

Answer 1

This is how you can fill a NumPy array with arbitrary Python objects: 这是使用任意Python对象填充NumPy数组的方法：

the_problem = np.empty(2, dtype='O')
the_problem[:] = [array_1, array_2]

I agree with iluengo that making a NumPy array of arrays is not taking advantage of NumPy's strengths because doing so requires the outer NumPy array to be of dtype object . 我同意iluengo的说法，制作NumPy数组阵列并没有利用NumPy的优势，因为这样做需要外部NumPy数组是dtype object 。 Object arrays require about the same amount of memory as a regular Python list, require more time to build than an equivalent Python list, are no faster at computation than an equivalent Python list. 对象数组需要与常规Python列表大约相同的内存量，需要比等效Python列表更多的时间来构建，计算速度不比等效的Python列表快。 Perhaps their only advantage is that they offer the ability to use NumPy array indexing syntax. 也许他们唯一的优势是他们提供了使用NumPy数组索引语法的能力。

Answer 2

Please refer to the official example of the numpy documentation, here . 请参阅这里的numpy文档的官方示例。

I think the main ingredient missing above is an implementation of __array_finalize__() . 我认为上面缺少的主要成分是__array_finalize__() 。

The example InfoArray() provided in the link correctly works as expected, without the hack of having to specify the dtype of the newly created array as argument: 链接中提供的示例InfoArray()正常工作正常，没有必须指定新创建的数组的dtype作为参数：

shape1 = (2,3)
array_1 = InfoArray(shape1)
print type(array_1)
#<class '__main__.InfoArray'>

shape2 = (1,2)
array_2 = dummy_class(shape2)
the_problem = numpy.array([array_1, array_2])
print type(the_problem)
#<type 'numpy.ndarray'>

print type(the_problem[0])
#<class '__main__.InfoArray'>

Moreover, it is useful to subclass a numpy array, and to aggregate many of them into a larger array like the_problem as reported above if the the resulting aggregate is a numpy array that is not of type object . 此外，如果生成的聚合是一个非类型为object的numpy数组，那么将numpy数组子类化并将它们中的许多聚合成一个更大的数组（如the_problem所报告的the_problem是很有用的。

As an example, say that array_1 and array_2 have the same shape: 例如，假设array_1和array_2具有相同的形状：

shape = (2,3)
array_1 = InfoArray(shape)
array_2 = InfoArray(shape)
the_problem = numpy.array([array_1, array_2])

Now the dtype of the_problem is not an object, and you can efficiently calculate for example the min as the_problem.min() . 现在dtype的the_problem不是一个对象，并可以有效地计算例如作为分the_problem.min() You can't do this if you use lists of your subclassed numpy arrays. 如果使用子类numpy数组的列表，则无法执行此操作。

创建一个numpy.ndarray，其元素由子类numpy.ndarray组成

问题描述

2 个解决方案

解决方案1
3 2015-01-09 12:47:15

解决方案2
0 2015-01-09 12:31:04

创建一个numpy.ndarray，其元素由子类numpy.ndarray组成

问题描述

2 个解决方案

解决方案1 3 2015-01-09 12:47:15

解决方案2 0 2015-01-09 12:31:04

解决方案1
3 2015-01-09 12:47:15

解决方案2
0 2015-01-09 12:31:04