如何在Pandas中使用group-by？

Question

My data-frame look like this(two columns col1,col2) 我的数据框看起来像这样（两列col1，col2）

I want to do group-by col1, 我想分组col1，

pd.DataFrame(combined.groupby('col1').aggregate(np.mean)['col2'])

This is returning a data-frame with only one key col2, I, actually, want the output to be like this( dataframe with two columns) 这是返回一个只有一个键col2的数据帧，实际上，我希望输出像这样（带有两列的数据帧）

col1,mean(col2),

could somebody point out what do I have to achive this? 有人可以指出我有什么要做到这一点？

Answer 1

You can use groupby with aggregating mean and reset_index : 您可以将groupby与聚合mean和reset_index ：

print df.groupby('col1')['col2'].mean().reset_index()
   col1  col2
0     1   150
1     2   150
2     3   170

Solution with groupby with paameter as_index=False as mentioned John Galt : 使用groupby和paameter的解决方案as_index=False如John Galt ：

print df.groupby('col1', as_index=False)['col2'].mean()
   col1  col2
0     1   150
1     2   150
2     3   170

Solution with aggregate : aggregate解决方案：

print df.groupby('col1', as_index=False).aggregate({'col2':'mean'})
   col1  col2
0     1   150
1     2   150
2     3   170

Aggregation in docs 文档中的聚合

如何在Pandas中使用group-by？

问题描述

1 个解决方案

解决方案1
0 2016-04-08 04:17:33

如何在Pandas中使用group-by？

问题描述

1 个解决方案

解决方案1 0 2016-04-08 04:17:33

解决方案1
0 2016-04-08 04:17:33