使用 Pandas Pivot Table 操作字符串类型列的数据框时出错

Question

I have the dataframe:我有数据框：

And I would like to obtain using Pivot Table or an alternative function this result:我想使用数据透视表或替代函数获得此结果：

I am trying to transform the rows of the Custom Field column into Columns, with the Pivot Table function of Pandas, and I get an error:我正在尝试使用 Pandas 的数据透视表功能将自定义字段列的行转换为列，但出现错误：

import pandas as pd

data = {
"Custom Field": ["CF1", "CF2", "CF3"],
"id": ["RSA", "RSB", "RSC"],
"Name": ["Wilson", "Junior", "Otavio"]
}

### create the dataframe ###
df = pd.DataFrame(data)

print(df)

df2 = df.pivot_table(columns=['Custom Field'], index=['Name'])
print(df2)

I suspect it is because I am working with Strings.我怀疑这是因为我正在使用字符串。

Any suggestions?有什么建议么？

Thanks in advance.提前致谢。

Answer 1

You need pivot , not pivot_table .您需要pivot ，而不是pivot_table 。 The latter does aggregation on possibly repeating values whereas the former is just a rearrangement of the values and fails for duplicate values.后者对可能重复的值进行聚合，而前者只是对值的重新排列并且对于重复值失败。

df.pivot(columns=['Custom Field'], index=['Name'])

Update as per comment: if there are multiple values per cell, you need to use privot_table and specify an appropriate aggregate function, eg concatenate the string values.根据评论更新：如果每个单元格有多个值，则需要使用privot_table并指定适当的聚合函数，例如连接字符串值。 You can also specify a fill value for empty cells (instead of NaN ):您还可以为空单元格指定填充值（而不是NaN ）：

df = pd.DataFrame({"Custom Field": ["CF1", "CF2", "CF3", "CF1"],
                   "id": ["RSA", "RSB", "RSC", "RSD"],
                   "Name": ["Wilson", "Junior", "Otavio", "Wilson"]})

df.pivot_table(columns=['Custom Field'], index=['Name'], aggfunc=','.join, fill_value='-')

                    id          
Custom Field       CF1  CF2  CF3
Name                            
Junior               -  RSB    -
Otavio               -    -  RSC
Wilson         RSA,RSD    -    -

使用 Pandas Pivot Table 操作字符串类型列的数据框时出错

问题描述

1 个解决方案

解决方案1
2 已采纳 2022-07-14 14:10:34

使用 Pandas Pivot Table 操作字符串类型列的数据框时出错

问题描述

1 个解决方案

解决方案1 2 已采纳 2022-07-14 14:10:34

解决方案1
2 已采纳 2022-07-14 14:10:34