计算熊猫DataFrame中行之间的距离

Question

我有一个熊猫DataFrame填充零，除了一些1.0值。 对于每一行，我想计算到下一次出现1.0的距离。 知道怎么做吗？

输入数据框：

预期的输出数据框：

Answer 1

采用：

df['new'] = df.groupby(df['col1'].eq(1).iloc[::-1].cumsum()).cumcount(ascending=False)
print (df)
   col1  new
0   0.0    4
1   0.0    3
2   0.0    2
3   0.0    1
4   1.0    0
5   0.0    2
6   0.0    1
7   1.0    0
8   0.0    0

说明：

首先将1与Series.eq进行比较：

print (df['col1'].eq(1))
0    False
1    False
2    False
3    False
4     True
5    False
6    False
7     True
8    False
Name: col1, dtype: bool

然后通过Series.iloc交换顺序：

print (df['col1'].eq(1).iloc[::-1])
8    False
7     True
6    False
5    False
4     True
3    False
2    False
1    False
0    False
Name: col1, dtype: bool

通过Series.cumsum创建组：

print (df['col1'].eq(1).iloc[::-1].cumsum())
8    0
7    1
6    1
5    1
4    2
3    2
2    2
1    2
0    2
Name: col1, dtype: int32

将组以ascending=False传递给GroupBy.cumcount ，以从后面进行计数：

print (df.groupby(df['col1'].eq(1).iloc[::-1].cumsum()).cumcount(ascending=False))
0    4
1    3
2    2
3    1
4    0
5    2
6    1
7    0
8    0
dtype: int64

计算熊猫DataFrame中行之间的距离

问题描述

1 个解决方案

解决方案1
2 已采纳 2019-04-12 12:20:15

计算熊猫DataFrame中行之间的距离

问题描述

1 个解决方案

解决方案1 2 已采纳 2019-04-12 12:20:15

解决方案1
2 已采纳 2019-04-12 12:20:15