[英]How to convert pandas data frame into list of tuples<string, list of list>
I have a data frame which has a following structure我有一个具有以下结构的数据框
title field1 field2 field3 field4 field5
title1 value11 value12 value13 value14 value15
title1 value21 value22 value23 value24 value25
title1 value31 value32 value33 value34 value35
title2 value1_1 value1_2 value1_3 value1_4 value1_5
title2 value2_1 value2_2 value2_3 value2_4 value2_5
title2 value3_1 value3_2 value3_3 value3_4 value3_5
I want to convert above data frame into a list of tuples<String, List of List>,我想将上面的数据框转换成一个元组列表<String, List of List>,
For example,例如,
title1, [
(value11, value12, value13, value14, value15),
(value21, value22, value23, value24, value25),
(value31, value32, value33, value34, value35)
]
title2, [
(value1_1, value1_2, value1_3, value1_4, value1_5),
(value2_1, value2_2, value2_3, value2_4, value2_5),
(value3_1, value3_2, value3_3, value3_4, value3_5)
]
You can use tuple()
inside lambda function in df.GroupBy.apply()
as follows:您可以在 df.GroupBy.apply() 中使用 lambda function 中的
tuple()
df.GroupBy.apply()
,如下所示:
Assuming the fields columns are from the 2nd column onwards.假设字段列从第二列开始。 If not, you can modify the column index range in
df.columns[1:]
below or explicitly use list the columns names below:如果没有,您可以在下面的
df.columns[1:]
中修改列索引范围,或者显式使用列出下面的列名称:
(df.groupby('title')[df.columns[1:]]
.apply(lambda x: [tuple(y) for y in x.to_numpy()])
.to_dict())
Result:结果:
{'title1': [('value11', 'value12', 'value13', 'value14', 'value15'),
('value21', 'value22', 'value23', 'value24', 'value25'),
('value31', 'value32', 'value33', 'value34', 'value35')],
'title2': [('value1_1', 'value1_2', 'value1_3', 'value1_4', 'value1_5'),
('value2_1', 'value2_2', 'value2_3', 'value2_4', 'value2_5'),
('value3_1', 'value3_2', 'value3_3', 'value3_4', 'value3_5')]}
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.