在numpy数组的仅一列中计数非零

Question

I have a Numpy array that is created as follows 我有一个创建如下的Numpy数组

data=np.zeros(500,dtype='float32, (50000,2)float32')

This array is filled with values that I acquire from some measurements, and is supposed to reflect that during each time point (room for 500 time points) we can acquire 50.000 x- and y- coords. 这个数组充满了我从一些测量中获得的值，并且应该反映出在每个时间点（500个时间点的空间）内，我们可以获取50.000 x和y坐标。

Later in my code is use a bisect -like search for which I need to know howmany X-coords (measurement points) are actually in my array which I originally did with np.count_nonzero(data) , this yielded the following problem: 稍后在我的代码中使用类似bisect的搜索，为此我需要知道实际上我最初使用np.count_nonzero(data)数组中实际上有多少X坐标（测量点np.count_nonzero(data) ，这产生了以下问题：

the non zero count returns 18 values here, the code then goes into the bisect -like search using data[time][1][0][0] as min X-coord and data[time][1][(np.count_nonzero(data)][0] as max x-coord which results in the array stopping at 9 instead of 10. 非零计数这里返回18倍的值，代码然后进入bisect样使用搜索data[time][1][0][0]为min X-坐标和data[time][1][(np.count_nonzero(data)][0]作为最大x坐标，导致数组从9而不是10停止。

I could use a while loop to manually count non-zero values (in the X-coord column) in the array but that would be silly, I assume that there is some builtin numpy functionality for this. 我可以使用while循环来手动计算数组中的非零值（在X坐标列中），但这很愚蠢，我假设有一些内置的numpy功能。 My question is then what builtin functionality or modification of my np.count_nonzero(data) I need since the documentation doesn't offer much information in that regards ( link to numpy doc). 然后我的问题是我需要什么内置功能或对我的np.count_nonzero(data)进行修改，因为文档在这方面没有提供太多信息（链接到numpy doc）。

-- Simplified question -- -简化的问题-

Can I use Numpy functionality to count the non-zero values for a singular column only? 我可以使用Numpy功能为单个列计算非零值吗？ (ie between data[time][1][0][0] and data[time][1][max][0] ) （即在data[time][1][0][0]和data[time][1][max][0] ）

Answer 1

Maybe a better approach would be to filter the array using nonzero and iterate over the result: 也许更好的方法是使用nonzero过滤数组并遍历结果：

nonZeroData = data[np.nonzero(data[time][1])]

To count zeros only from the second column: 要仅从第二列计算零：

nonZeroYCount = np.count_nonzero(data[time][1][:, 1])

Answer 2

If I understand you correctly, to select elements from data[time][1][0][0] to data[time][1][max][0] : 如果我理解正确，请从data[time][1][0][0]到data[time][1][max][0]选择元素：

data[time][1][:max+1,0]

EDIT : 编辑：

To count all non-zero for every time: 要每次都计算所有非零值：

(data["f1"][:,:,0] != 0).sum(1)

Answer 3

Why not consider using data != 0 to get the bool matrix? 为什么不考虑使用data != 0来获取bool矩阵？

You can use: 您可以使用：

stat = sum(data != 0) to count the non-zero entries. stat = sum(data != 0)计算非零条目。

I am not sure what shape your data array has but hope you can see what I mean. 我不确定data数组的形状，但希望您能明白我的意思。 :) :)

在numpy数组的仅一列中计数非零

问题描述

3 个解决方案

解决方案1
2 已采纳 2013-12-06 12:17:17

解决方案2
1 2013-12-06 12:27:07

解决方案3
0 2013-12-06 12:38:45

在numpy数组的仅一列中计数非零

问题描述

3 个解决方案

解决方案1 2 已采纳 2013-12-06 12:17:17

解决方案2 1 2013-12-06 12:27:07

解决方案3 0 2013-12-06 12:38:45

解决方案1
2 已采纳 2013-12-06 12:17:17

解决方案2
1 2013-12-06 12:27:07

解决方案3
0 2013-12-06 12:38:45