熊猫如何根据其他列中的值汇总一列的总和

Question

I am trying to sum values in a column by groupby on values in a second column, but meanwhile also considering values on a 3rd column, the df is like, 我试图对第二列中的值进行groupby ，但同时也考虑了第三列中的值， df就像，

id    memo    amount   
 1    pos     1.0 
 1    pos     2.0
 1    neg     3.0
 2    pos     4.0
 2    pos     5.0
 2    neg     6.0
 2    neg     7.0

I want to group by id and sum amount , but each group, if memo is pos it is positive and neg for negative, eg when groupby 1 , the total amount is 0, since -1.0 - 2.0 + 3.0 = 0 . 我想组由id和合计amount ，但每一组中，如果memo是pos它是正和neg负，例如，当groupby 1 ，总量为0时，由于-1.0 - 2.0 + 3.0 = 0 。

If I do df.groupby('id')['amount'].sum() , it only considers id and amount column, I am wondering how to also take memo into account here. 如果我执行df.groupby('id')['amount'].sum() ，则仅考虑id和amount列，我想知道如何在此处也考虑memo 。

so the result will look like, 所以结果看起来像

id    memo    amount    total_amount   
 1    pos     1.0       0.0
 1    pos     2.0       0.0
 1    neg     3.0       0.0
 2    pos     4.0       -4.0
 2    pos     5.0       -4.0
 2    neg     6.0       -4.0
 2    neg     7.0       -4.0

Answer 1

Splitting the operation in two steps, you can achieve what you want through 分两步进行操作，即可实现所需的目标

df['temp'] = np.where(df.memo == 'pos', df.amount, -df.amount)
df['total_amount'] = df.groupby('id').temp.transform(sum)

Answer 2

Another fun way with mapping and multiplying ie 映射和乘法的另一种有趣方式，即

df['new'] = (df.set_index('id')['memo'].map({'pos':1,'neg':-1})*df['amount'].values)\
            .groupby(level=0).transform(sum).values

Output : 输出：

   id memo  amount  new
0   1  pos     1.0  0.0
1   1  pos     2.0  0.0
2   1  neg     3.0  0.0
3   2  pos     4.0 -4.0
4   2  pos     5.0 -4.0
5   2  neg     6.0 -4.0
6   2  neg     7.0 -4.0

熊猫如何根据其他列中的值汇总一列的总和

问题描述

2 个解决方案

解决方案1
1 已采纳 2017-11-24 17:32:53

解决方案2
1 2017-11-24 18:18:47

熊猫如何根据其他列中的值汇总一列的总和

问题描述

2 个解决方案

解决方案1 1 已采纳 2017-11-24 17:32:53

解决方案2 1 2017-11-24 18:18:47

解决方案1
1 已采纳 2017-11-24 17:32:53

解决方案2
1 2017-11-24 18:18:47