将嵌套列表和字典转换为数据框？

Question

我有一本具有数千个键的字典，如下所示：

 my_dictionary:
                {'key1':[['ft1',[2,4,12,2]],['ft2',[0,3,3,1]],'key2':[['ft1',[5,0,2,9]],['ft2',[10,39,3,2]]}

现在，我想将此字典转换为数据框，其中键应为特定列，并且要素（ft1，ft2，..）及其值也将转换为不同的列。 所以我想要的数据框应该是这样的：

 my_new_dataframe:
             ID, ft1_sig,ft1_med,ft1_les,ft1_non,ft2_sig,ft2_med,ft2_les,ft2_non,... 
             key1    2       4      12      2       0       3       3.     1
             key2    5.      0.      2.     9.      10.     39     3.     2
             ...
             keyn.  ..       ..      ..     ..

Answer 1

我为此尝试了一种解决方案，但是它要求每个键（即key1，key2等）都包含您要在字典中使用的ft属性。 另外，您是否为原始列表缺少“]”？ 当我粘贴到翻译器中时，它不匹配。

import pandas as pd

#added method to change your original dictionary to one that I can manipulate with the method below.

#If you compare the values of new_dict and data using ==, it returns true.
my_dictionary = {'key1':[['ft1',[2,4,12,2]],['ft2',[0,3,3,1]]],'key2':[['ft1',[5,0,2,9]],['ft2',[10,39,3,2]]]}

new_dict ={}
for element in my_dictionary:
    print(element)
    print(my_dictionary[element])
    new_dict[element] = dict(my_dictionary[element])
    print(new_dict)

data = {
    'key1':{
        'ft1':[2,4,12,2],
        'ft2':[0,3,3,1]
        },
    'key2':{
        'ft1':[5,0,2,9],
        'ft2':[10,39,3,2]
        }
    }

keys = list(data.keys())

df = pd.DataFrame.from_dict(data).T

df2 = pd.DataFrame(df.ft1.values.tolist()).add_prefix('ft1_')
df3 = pd.DataFrame(df.ft2.values.tolist()).add_prefix('ft2_')
df4 = pd.merge(df2,df3,left_index=True,right_index=True)
df4.index=keys

print(df4)

这是输出：

Answer 2

我在您的示例中添加了更多数据，以表明如果添加了新功能或行（键），则脚本将具有灵活性。

收集列表中的所有功能（名称）
仅获取号码列表
分配新的列名
创建一个函数，该函数将返回数字列表中的每个项目
使用Apply函数创建新列
删除/删除临时列

这里是

mydict = {'key1':[['ft1',[2,4,12,2]],['ft2',[0,3,3,1]],['ft3',[0,3,3,1]]],
          'key2':[['ft1',[5,0,2,9]],['ft2',[10,39,3,2]],['ft3',[0,3,3,1]]]
         ,'key3':[['ft1',[5,0,2,9]],['ft2',[10,39,3,2]],['ft3',[0,3,3,1]]]} 
df = pd.DataFrame(mydict).T 
colname = [df[c][0][0] for c in df]
df = df.applymap(lambda c: c[1])
df.reset_index(level=0, inplace=True)
df.columns=['ID'] + colname
s=['_sig','_med','_les','_non']
def f(x):
    return pd.Series([x[0], x[1], x[2], x[3]])
for col in colname:
    df[[col+'_sig', col+'_med', col+'_les', col+'_non']]= df[col].apply(lambda x: f(x))
df.drop(colname, axis=1, inplace=True)
df

结果：

     ID ft1_sig ft1_med ft1_les ft1_non ft2_sig ft2_med ft2_les ft2_non ft3_sig ft3_med ft3_les ft3_non
0   key1    2    4  12  2    0   3  3   1   0   3   3   1
1   key2    5    0   2  9   10  39  3   2   0   3   3   1
2   key3    5    0   2  9   10  39  3   2   0   3   3   1

将嵌套列表和字典转换为数据框？

问题描述

2 个解决方案

解决方案1
1 已采纳 2019-05-07 18:08:49

解决方案2
1 2019-05-07 18:24:04

将嵌套列表和字典转换为数据框？

问题描述

2 个解决方案

解决方案1 1 已采纳 2019-05-07 18:08:49

解决方案2 1 2019-05-07 18:24:04

解决方案1
1 已采纳 2019-05-07 18:08:49

解决方案2
1 2019-05-07 18:24:04