Pandas groupby 累积总和忽略当前行

Question

I know there's some questions about this topic (like Pandas: Cumulative sum of one column based on value of another ) however, none of them fuull fill my requirements.我知道关于这个主题有一些问题（比如Pandas: Cumulative sum of one column based on value of another ）但是，它们都不能完全满足我的要求。

Let's say I have a dataframe like this one假设我有一个像这样的数据框

. .

I want to compute the cumulative sum of Cost grouping by month, avoiding taking into account the current value, in order to get the Desired column.By using groupby and cumsum I obtain colum CumSum我想按月计算 Cost 分组的累积总和，避免考虑当前值，以获得 Desired 列。通过使用groupby和cumsum我获得 colum CumSum

. .

The DDL to generate the dataframe is生成数据帧的 DDL 是

df = pd.DataFrame({'Month': [1,1,1,2,2,1,3],
                   'Cost': [5,8,10,1,3,4,1]})

Answer 1

IIUC you can use groupby.cumsum and then just subtract cost ; IIUC 你可以使用groupby.cumsum然后减去cost ；

df['cumsum_'] = df.groupby('Month').Cost.cumsum().sub(df.Cost)

print(df)

    Month  Cost  cumsum_
0      1     5        0
1      1     8        5
2      1    10       13
3      2     1        0
4      2     3        1
5      1     4       23
6      3     1        0

Answer 2

You can do the following:您可以执行以下操作：

df['agg']=df.groupby('Month')['Cost'].shift().fillna(0)
df['Cumsum']=df['Cost']+df['agg']

Pandas groupby 累积总和忽略当前行

问题描述

2 个解决方案

解决方案1
3 2020-03-16 15:51:18

解决方案2
1 已采纳 2020-03-16 15:53:39

Pandas groupby 累积总和忽略当前行

问题描述

2 个解决方案

解决方案1 3 2020-03-16 15:51:18

解决方案2 1 已采纳 2020-03-16 15:53:39

解决方案1
3 2020-03-16 15:51:18

解决方案2
1 已采纳 2020-03-16 15:53:39