Python Pandas，如何对字典列表进行分组和排序

Question

我有一个 dict 列表，如：

data = [
    {'ID': '000681', 'type': 'B:G+',  'testA': '11'},
    {'ID': '000682', 'type': 'B:G+',  'testA': '-'},
    {'ID': '000683', 'type': 'B:G+',  'testA': '13'},
    {'ID': '000684', 'type': 'B:G+',  'testA': '14'},
    {'ID': '000681', 'type': 'B:G+',  'testB': '15'},
    {'ID': '000682', 'type': 'B:G+',  'testB': '16'},
    {'ID': '000683', 'type': 'B:G+',  'testB': '17'},
    {'ID': '000684', 'type': 'B:G+',  'testB': '-'}
]

如何使用 Pandas 获取如下数据：

data = [
    {'ID': '000683', 'type': 'B:G+',  'testA': '13',  'testB': '17'},
    {'ID': '000681', 'type': 'B:G+',  'testA': '11',  'testB': '15'},
    {'ID': '000684', 'type': 'B:G+',  'testA': '14',  'testB': '-'},
    {'ID': '000682', 'type': 'B:G+',  'testA': '-',  'testB': '16'}

]

与一个 col 相同的ID和相同的type testA和testB值排序

sorted ： testA和testB都在顶部有testA+testB值和较大的值。

Answer 1

首先将列转换为数字，将非数字替换为整数，然后聚合sum ：

df = pd.DataFrame(data)    
c = ['testA','testB']
df[c] = df[c].apply(lambda x: pd.to_numeric(x, errors='coerce'))

df1 = df.groupby(['ID','type'])[c].sum(min_count=1).sort_values(c).fillna('-').reset_index()
print (df1)
       ID  type testA testB
0  000681  B:G+    11    15
1  000683  B:G+    13    17
2  000684  B:G+    14     -
3  000682  B:G+     -    16

如果Series.argsort两列的总和排序，请使用Series.argsort ：

df = pd.DataFrame(data)
c = ['testA','testB']
df[c] = df[c].apply(lambda x: pd.to_numeric(x, errors='coerce'))

df2 = df.groupby(['ID','type'])[c].sum(min_count=1)
df2 = df2.iloc[(-df2).sum(axis=1).argsort()].fillna('-').reset_index()
print (df2)
       ID  type testA testB
0  000683  B:G+    13    17
1  000681  B:G+    11    15
2  000682  B:G+     -    16
3  000684  B:G+    14     -

Python Pandas，如何对字典列表进行分组和排序

问题描述

1 个解决方案

解决方案1
2 2020-01-16 10:01:54

Python Pandas，如何对字典列表进行分组和排序

问题描述

1 个解决方案

解决方案1 2 2020-01-16 10:01:54

解决方案1
2 2020-01-16 10:01:54