将Pandas数据框转换为嵌套JSON的更快方法

Question

I have data that looks like this: 我有看起来像这样的数据：

player, goals, matches
ronaldo, 10, 5
messi, 7, 9

I want to convert this dataframe into a nested json, such as this one: 我想将此数据帧转换为嵌套的json，例如：

{
    "content":[
        {
            "player": "ronaldo",
            "events": {
                "goals": 10,
                "matches": 5
            }
        },
        {
            "player": "messi",
            "events": {
                "goals": 7,
                "matches": 9
            }
        }
    ]
}

This is my code, using list comprehension: 这是我的代码，使用列表理解：

df = pd.DataFrame([['ronaldo', 10, 5], ['messi', 7, 9]], columns=['player', 'goals', 'matches'])
d = [{'events': df.loc[ix, ['goals', 'matches']].to_dict(), 'player': df.loc[ix, 'player']} for ix in range(df.shape[0])]
j = {}
j['content'] = d

This works, but the performance is really slow when I have a lot of data. 这行得通，但是当我有很多数据时，性能确实很慢。 Is there a faster way to do it? 有更快的方法吗？

Answer 1

Use pandas.to_json . 使用pandas.to_json 。 Fast and easy https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_json.html 快速简便的https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_json.html

df.T.to_json()

Answer 2

try : 尝试：

df.to_json(orient = "records")

The problem is it doesn't stack goals and matches on the event column , I'm not sure though if you can do it without looping 问题是它没有在事件列上堆积目标和匹配项，但我不确定是否可以不循环而完成

将Pandas数据框转换为嵌套JSON的更快方法

问题描述

2 个解决方案

解决方案1
0 2019-07-25 21:02:42

解决方案2
0 2019-07-25 21:17:54

将Pandas数据框转换为嵌套JSON的更快方法

问题描述

2 个解决方案

解决方案1 0 2019-07-25 21:02:42

解决方案2 0 2019-07-25 21:17:54

解决方案1
0 2019-07-25 21:02:42

解决方案2
0 2019-07-25 21:17:54