Pandas groupby -> 聚合 - 两列的 function

Question

I'm using pandas aggregate as folows:我正在使用 pandas aggregate如下：

In [6]: gb = df.groupby(['col1', 'col2'])
   ...: counts = gb.size().to_frame(name='counts')
   ...: (counts
   ...:  .join(gb.agg({'col3': 'mean'}).rename(columns={'col3': 'col3_mean'}))
   ...:  .join(gb.agg({'col4': 'median'}).rename(columns={'col4': 'col4_median'}))
   ...:  .join(gb.agg({'col4': 'min'}).rename(columns={'col4': 'col4_min'}))
   ...:  .reset_index()
   ...: )

How can I add one more column which will contain sum of values col3 * col4 ?如何再添加一列，其中包含值col3 * col4的总和？

Answer 1

First create column new before groupby and then aggregate sum , your solution rewritten in named aggregation is:首先在groupby之前创建 column new然后聚合sum ，您在命名聚合中重写的解决方案是：

counts = (df.assign(new = df['col3'] * df['col4'])
            .groupby(['col1', 'col2'], as_index=False)
            .agg(counts=('col1','size'), 
                 col3_mean=('col3','mean'), 
                 col4_median=('col4','median'), 
                 col4_min=('col4','min'), 
                 both_sum=('new','sum')))

Pandas groupby -> 聚合 - 两列的 function

问题描述

1 个解决方案

解决方案1
0 已采纳 2022-03-11 13:29:31

Pandas groupby -> 聚合 - 两列的 function

问题描述

1 个解决方案

解决方案1 0 已采纳 2022-03-11 13:29:31

解决方案1
0 已采纳 2022-03-11 13:29:31