清洗和提取 JSON 来自 Pandas 列

Question

I have a DataFrame with the following structure:我有一个具有以下结构的 DataFrame：

id    year  name        homepage        
238   2022  Adventure  {'keywords': 'en', 'genres':[{"revenue": 1463, "name": "culture clash"}], 'runtime': 150, 'vote_average': 7}

But what I need is this structure但是我需要的是这个结构

   id    year   name        keywords    revenue     name               runtime vote_average
   238   2022  Adventure     en         1460        culture clash      150     7

How can I do this?我怎样才能做到这一点？

Answer 1

The idea is to json_normalize "homepage" column and join it back to df .这个想法是json_normalize “homepage” 列并将其join回df 。 You can pass meta and the record path directly into json_normalize as parameters:您可以将元数据和记录路径作为参数直接传递到json_normalize中：

out = (df.join(pd.json_normalize(df['homepage'], record_path='genres', 
                                 meta=['keywords', 'runtime', 'vote_average']), 
               lsuffix='', rsuffix='_genres')
       .drop(columns='homepage'))

Output: Output：

    id  year       name keywords  revenue    name_genres runtime vote_average
0  238  2022  Adventure       en     1463  culture clash     150            7

清洗和提取 JSON 来自 Pandas 列

问题描述

1 个解决方案

解决方案1
3 已采纳

清洗和提取 JSON 来自 Pandas 列

问题描述

1 个解决方案

解决方案1 3 已采纳

解决方案1
3 已采纳