使用唯一列值作为键转换Pandas Dataframe to_dict（）

Question

How can I convert a pandas dataframe to a dict using unique column values as the keys for the dictionary? 如何使用唯一列值作为字典的键将pandas数据帧转换为dict？ In this case I want to use unique username's as the key. 在这种情况下，我想使用唯一的用户名作为密钥。

Here is my progress so far based on information found on here and online. 到目前为止，根据此处和在线信息，我的进展如下。

My test dataframe: 我的测试数据帧：

import pandas
import pprint

df = pandas.DataFrame({
    'username': ['Kevin', 'John', 'Kevin', 'John', 'Leslie', 'John'], 
    'sport': ['Soccer', 'Football', 'Racing', 'Tennis', 'Baseball', 'Bowling'],
    'age': ['51','32','20','19','34','27'],
    'team': ['Cowboyws', 'Packers', 'Sonics', 'Raiders', 'Wolves', 'Lakers']
})

I can create a dictionary by doing this: 我可以通过这样做来创建一个字典：

dct = df.to_dict(orient='records')
pprint.pprint(dct, indent=4)

>>>>[{'age': '51', 'sport': 'Soccer', 'team': 'Cowboyws', 'username': 'Kevin'},
    {'age': '32', 'sport': 'Football', 'team': 'Packers', 'username': 'John'},
    {'age': '20', 'sport': 'Racing', 'team': 'Sonics', 'username': 'Kevin'},
    {'age': '19', 'sport': 'Tennis', 'team': 'Raiders', 'username': 'John'},
    {'age': '34', 'sport': 'Baseball', 'team': 'Wolves', 'username': 'Leslie'},
    {'age': '27', 'sport': 'Bowling', 'team': 'Lakers', 'username': 'John'}]

I tried using the groupby and apply method which got me closer but it converts all the values to lists. 我尝试使用groupby和apply方法使我更接近，但它将所有值转换为列表。 I want them to remain as dictionaries so i can retain the each value's key: 我希望它们保留为字典，以便我可以保留每个值的键：

result = df.groupby('username').apply(lambda x: x.values.tolist()).to_dict()
pprint.pprint(result, indent=4)

{   'John': [   ['32', 'Football', 'Packers', 'John'],
                ['19', 'Tennis', 'Raiders', 'John'],
                ['27', 'Bowling', 'Lakers', 'John']],
    'Kevin': [   ['51', 'Soccer', 'Cowboyws', 'Kevin'],
                 ['20', 'Racing', 'Sonics', 'Kevin']],
    'Leslie': [['34', 'Baseball', 'Wolves', 'Leslie']]}

This is the desired result I want: 这是我想要的结果：

{   
    'John': [{'age': '32', 'sport': 'Football', 'team': 'Packers', 'username': 'John'},
             {'age': '19', 'sport': 'Tennis', 'team': 'Raiders', 'username': 'John'},
             {'age': '27', 'sport': 'Bowling', 'team': 'Lakers', 'username': 'John'}],
    'Kevin': [{'age': '51', 'sport': 'Soccer', 'team': 'Cowboyws', 'username': 'Kevin'},
              {'age': '20', 'sport': 'Racing', 'team': 'Sonics', 'username': 'Kevin'}],
    'Leslie': [{'age': '34', 'sport': 'Baseball', 'team': 'Wolves', 'username': 'Leslie'}]
}

Answer 1

Use groupby and apply . 使用groupby并apply 。 Inside the apply, call to_dict with the "records" orient (similar to what you've figured out already). 在apply中，使用“记录”方向调用to_dict （类似于你已经想到的）。

df.groupby('username').apply(lambda x: x.to_dict(orient='r')).to_dict()

Answer 2

I prefer using for loop here , also you may want to drop the username columns , since it is redundant 我更喜欢在这里使用for循环，你也可以drop username名列，因为它是多余的

d = {x: y.drop('username',1).to_dict('r') for x , y in df.groupby('username')}
d
Out[212]: 
{'John': [{'age': '32', 'sport': 'Football', 'team': 'Packers'},
  {'age': '19', 'sport': 'Tennis', 'team': 'Raiders'},
  {'age': '27', 'sport': 'Bowling', 'team': 'Lakers'}],
 'Kevin': [{'age': '51', 'sport': 'Soccer', 'team': 'Cowboyws'},
  {'age': '20', 'sport': 'Racing', 'team': 'Sonics'}],
 'Leslie': [{'age': '34', 'sport': 'Baseball', 'team': 'Wolves'}]}

使用唯一列值作为键转换Pandas Dataframe to_dict（）

问题描述

2 个解决方案

解决方案1
3 已采纳 2019-01-10 16:12:06

解决方案2
2 2019-01-10 16:15:25

使用唯一列值作为键转换Pandas Dataframe to_dict（）

问题描述

2 个解决方案

解决方案1 3 已采纳 2019-01-10 16:12:06

解决方案2 2 2019-01-10 16:15:25

解决方案1
3 已采纳 2019-01-10 16:12:06

解决方案2
2 2019-01-10 16:15:25