如何使用字典格式的重复值从 pandas dataframe 创建 aa 列

Question

i'm very confused on how to do this, (i'm very newbie yet) and I need to convert this dataframe into a dictionary with a column for repeated values:我对如何执行此操作感到很困惑（我还是新手），我需要将此 dataframe 转换为包含重复值列的字典：

import pandas as pd
df = pd.DataFrame({'Name': [['John', 'hock'], ['John','pepe'],['Peter', 'wdw'],['Peter'],['John'], ['Stef'], ['John']],
                   'Age': [38, 47, 63, 28, 33, 45, 66]
                  })

and i need something like:我需要这样的东西：

Name Age Repeated:
John 38  4

thanks!谢谢！

Answer 1

Use DataFrame.explode with GroupBy.size :使用DataFrame.explode和GroupBy.size ：

df = df.explode('Name').groupby(['Name']).size().reset_index(name='Repeated')
print (df)
    Name  Repeated
0   John         4
1  Peter         2
2   Stef         1
3   hock         1
4   pepe         1
5    wdw         1

Answer 2

I can think of something like:我可以想到类似的东西：

resultDict = {}
for index, row in df.iterrows():
  for value in row["Name"]:
    if value not in resultDict:
      resultDict[value] = 0
    resultDict[value] += 1
resultDict

Output Output

{'John': 4, 'Peter': 2, 'Stef': 1, 'hock': 1, 'pepe': 1, 'wdw': 1}

If you want to have it as a dataframe and not a dictionary:如果您想将其作为 dataframe 而不是字典：

resultDict = {}
for index, row in df.iterrows():
  for value in row["Name"]:
    if value not in resultDict:
      resultDict[value] = 0
    resultDict[value] += 1
pd.DataFrame({"Name":resultDict.keys(), "Repeated":resultDict.values()})

Output Output

Name名称	Repeated重复
John约翰	4 4个
hock飞节	1 1个
pepe佩佩	1 1个
Peter彼得	2 2个
wdw wdw	1 1个
Stef斯特夫	1 1个

如何使用字典格式的重复值从 pandas dataframe 创建 aa 列

问题描述

2 个解决方案

解决方案1
1 2022-03-21 11:31:01

解决方案2
1 已采纳 2022-03-21 11:31:02

Output Output

Output Output

如何使用字典格式的重复值从 pandas dataframe 创建 aa 列

问题描述

2 个解决方案

解决方案1 1 2022-03-21 11:31:01

解决方案2 1 已采纳 2022-03-21 11:31:02

Output Output

Output Output

解决方案1
1 2022-03-21 11:31:01

解决方案2
1 已采纳 2022-03-21 11:31:02