使用python pandas将json转换为csv

Question

I have referred this: Nested Json to pandas DataFrame with specific format 我已经提到了这一点：将Json嵌套到具有特定格式的pandas DataFrame

and this: json_normalize produces confusing KeyError 这： json_normalize产生令人困惑的KeyError

to try and normalize my json snippet using json_normalize in pandas. 尝试在熊猫中使用json_normalize对我的json代码段进行规范化。 However, the output isn't getting normalized fully. 但是，输出未完全标准化。 Here's a snippet of my code 这是我的代码片段

x =[{'fb_metrics': [{'period': 'lifetime', 'values': [{'value': {'share': 2, 'like': 10}}], 'title': 'Lifetime Post Stories by action type', 'name': 'post_stories_by_action_type', '_id': '222530618111374_403476513350116/insights/post_stories_by_action_type/lifetime', 'description': 'Lifetime: The number of stories created about your Page post, by action type. (Total Count)'}]}]

df = pd.io.json.json_normalize(x[0]['fb_metrics'])

The output for values column is 值列的输出是

values
[{'value': {'share': 2, 'like': 10}}]

I would've liked to have two column outputs instead like 我本来希望有两个列输出，而不是像

value.share   value.like
2                10

How should I achieve this? 我应该如何实现？

Answer 1

You might apply json_normalize to the values column one more time to flatten it: 您可能需要再一次将json_normalize应用于values列以使其扁平化：

pd.concat([
    df.drop('values', 1), 
    df['values'].apply(lambda x: pd.io.json.json_normalize(x).iloc[0])
], axis=1)

Answer 2

For your dataframe, 对于您的数据框，

You can create a new DataFrame from the nested dictionary within values using df.from_dcit() do: 您可以使用df.from_dcit()在值内的嵌套字典中创建新的DataFrame：

df2 = pd.DataFrame.from_dict(df['values'].values[0][0], orient = 'index').reset_index().drop(['index'], axis=1)

to get: 要得到：

df2:

   share  like
0      2    10

Then add this to your existing dataframe to get the format you need using pd.concat : 然后将其添加到您现有的数据pd.concat以使用pd.concat获得所需的格式：

result = pd.concat([df, df2], axis=1, join='inner')

result[['values', 'share', 'like']]
Out[74]: 
                                     values  share  like
0  [{u'value': {u'share': 2, u'like': 10}}]      2    10

If needed can rename: 如果需要可以重命名：

result.rename(columns={'share': 'values.share', 'like':'values.like'}, inplace=True)

result[['values', 'share', 'like']]
Out[74]: 
                                     values  values.share  values.like
0  [{u'value': {u'share': 2, u'like': 10}}]             2           10

Answer 3

import pandas as pd
df = pd.read_json('data.json')
df.to_csv('data.csv', index=False, columns=['title', 'subtitle', 'date', 
'description'])

import pandas as pd
df = pd.read_csv("data.csv")
df = df[df.columns[:4]]
df.dropna(how='all')
df.to_json('data.json', orient='records')

使用python pandas将json转换为csv

问题描述

3 个解决方案

解决方案1
1 已采纳 2017-04-16 13:05:58

解决方案2
1 2017-04-16 13:06:56

解决方案3
0 2017-09-16 03:35:12

使用python pandas将json转换为csv

问题描述

3 个解决方案

解决方案1 1 已采纳 2017-04-16 13:05:58

解决方案2 1 2017-04-16 13:06:56

解决方案3 0 2017-09-16 03:35:12

解决方案1
1 已采纳 2017-04-16 13:05:58

解决方案2
1 2017-04-16 13:06:56

解决方案3
0 2017-09-16 03:35:12