python pandas按组排序

Question

Each row in my DataFrame is a user vote entry for a restaurant. 我的DataFrame中的每一行都是餐厅的用户投票项。 The data look like 数据看起来像

id   cuisine    
91   american   
3    american   
91   american   
233  cuban      
233  cuban      
2    cuban

where id refers to the restaurant. 其中id是指餐厅。

I want to get something like the following 我想得到类似以下内容

american  91   100
          3    30
          12   10
cuban     233  80
          2    33
mexican   22   99
          8    98
          21   82

where the 2nd column is the id , and the 3rd column is the number of rows in the DataFrame for that id . 其中第二列是id ，第三列是该id在DataFrame中的行数。 In other words, sort by the number of rows, but group by cuisine. 换句话说，按行数排序，但按美食分组。 I tried 我试过了

g = df.groupby(['cuisine', 'id'])
c = g.size().sort_values(ascending=False)

But the order of the cuisines is mixed. 但是美食的顺序是混杂的。

Answer 1

is that what you want? 那是你要的吗？

In [2]: df
Out[2]:
    id   cuisine
0   91  american
1    3  american
2   91  american
3  233     cuban
4  233     cuban
5    2     cuban

In [3]: df.groupby(['cuisine', 'id']).size()
Out[3]:
cuisine   id
american  3      1
          91     2
cuban     2      1
          233    2
dtype: int64

or as a data frame: 或作为数据框：

In [10]: df.groupby(['cuisine', 'id']).size().reset_index(name='count').sort_values(['cuisine', 'count'], ascending=[1,0])
Out[10]:
    cuisine   id  count
1  american   91      2
0  american    3      1
3     cuban  233      2
2     cuban    2      1

Answer 2

use value_counts after group_by followed by sort_index 在group_by之后使用value_counts ，后跟sort_index

# ascending=[1, 0] says True for level[0], False for level[1]
df.groupby('cuisine').id.value_counts().sort_index(ascending=[1, 0])

cuisine   id 
american  91     2
          3      1
cuban     233    2
          2      1
Name: id, dtype: int64

python pandas按组排序

问题描述

2 个解决方案

解决方案1
2 2016-07-19 16:56:16

解决方案2
2 已采纳 2016-07-19 17:27:51

python pandas按组排序

问题描述

2 个解决方案

解决方案1 2 2016-07-19 16:56:16

解决方案2 2 已采纳 2016-07-19 17:27:51

解决方案1
2 2016-07-19 16:56:16

解决方案2
2 已采纳 2016-07-19 17:27:51