在没有 for 循环的情况下，将包含 if 的函数应用于 pandas 中数据帧的每一行

Question

Given a dataframe, I want to get the nonzero values of each row and then find the minimum of absolute values.给定一个数据框，我想获取每行的非零值，然后找到绝对值的最小值。 I want to have a user defined function that does this for me.我想要一个用户定义的函数来为我做这件事。 Also, I do not want to use any for loop since the data is big.另外，我不想使用任何 for 循环，因为数据很大。

My try我的尝试

np.random.seed(5)
data = np.random.randn(16)
mask = np.random.permutation(16)[:6]
data[mask] = 0
df = pd.DataFrame(data.reshape(4,4))

          0         1         2         3
0  0.441227 -0.330870  2.430771  0.000000
1  0.000000  1.582481 -0.909232 -0.591637
2  0.000000 -0.329870 -1.192765  0.000000
3  0.000000  0.603472  0.000000 -0.700179


def udf(x):
  if x != 0:
    x_min = x.abs().min()
  return x_min
df.apply(udf, axis=1)

I get ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().我得到ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all(). ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().

Question How can I solve the above?问题我该如何解决上述问题？

The desired answer is the following:期望的答案如下：

Answer 1

You can use x.ne(0) as boolean indexing to filter row您可以使用x.ne(0)作为布尔索引来过滤行

res = df.apply(lambda x: x[x.ne(0)].abs().min(), axis=1)

print(res)

0    0.330870
1    0.591637
2    0.329870
3    0.603472
dtype: float64

Or use min(axis=1)或使用min(axis=1)

res = df[df.ne(0)].abs().min(axis=1)

在没有 for 循环的情况下，将包含 if 的函数应用于 pandas 中数据帧的每一行

问题描述

1 个解决方案

解决方案1
5 已采纳 2022-06-05 08:41:39

在没有 for 循环的情况下，将包含 if 的函数应用于 pandas 中数据帧的每一行

问题描述

1 个解决方案

解决方案1 5 已采纳 2022-06-05 08:41:39

解决方案1
5 已采纳 2022-06-05 08:41:39