[英]How to create a parent-child dataframe from an existing pandas dataframe of hierarchical structure and arbitrary shape?
如何將具有層次結構和任意形狀(例如,類似於下面的形狀)的給定 dataframe 轉換為具有父列和子列的新 dataframe?
編輯:請注意,約束是子項不能是其自己的父項。
data = {'level1': ['A', 'A', 'B', 'B', 'C'],
'level2': ['James', 'Robert', 'Patricia', 'Patricia', 'John'],
'level3': ['Stockholm', 'Denver', 'Moscow', 'Moscow', 'Palermo'],
'level4': ['red', 'Denver', 'yellow', 'purple', 'blue']
}
df = pd.DataFrame(data)
level1 level2 level3 level4
0 A James Stockholm red
1 A Robert Denver Denver
2 B Patricia Moscow yellow
3 B Patricia Moscow purple
4 C John Palermo blue
所需的 output 是這樣的:
parent child
0 A James
1 A Robert
2 B Patricia
3 C John
4 James Stockholm
5 Robert Denver
6 Patricia Moscow
7 John Palermo
8 Stockholm red
9 Moscow yellow
10 Moscow purple
11 Palermo blue
我能想到的是在 dataframe 的列上使用 for 循環:
columns = df.columns
length = len(columns)
parent = []
children = []
for i in range(length):
if i != length - 1 :
parent += df[columns[i]].to_list()
children += df[columns[i+1]].to_list()
newDf = pd.DataFrame({"parent":parent, "children":children}).drop_duplicates()
newDf[newDf["parent"] != newDf["children"]]
parent children
0 A James
1 A Robert
2 B Patricia
4 C John
5 James Stockholm
6 Robert Denver
7 Patricia Moscow
9 John Palermo
10 Stockholm red
12 Moscow yellow
13 Moscow purple
14 Palermo blue
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.