[英]Convert Nested Dictionary to CSV Table
我正在閱讀數據挖掘教程,並且正在使用以下詞典。
users = {
"Angelica": {
"Blues Traveler": 3.5,
"Broken Bells": 2.0,
"Norah Jones": 4.5,
"Phoenix": 5.0,
"Slightly Stoopid": 1.5,
"The Strokes": 2.5,
"Vampire Weekend": 2.0
},
"Bill":{
"Blues Traveler": 2.0,
"Broken Bells": 3.5,
"Deadmau5": 4.0,
"Phoenix": 2.0,
"Slightly Stoopid": 3.5,
"Vampire Weekend": 3.0
},
"Chan": {
"Blues Traveler": 5.0,
"Broken Bells": 1.0,
"Deadmau5": 1.0,
"Norah Jones": 3.0,
"Phoenix": 5,
"Slightly Stoopid": 1.0
},
"Dan": {
"Blues Traveler": 3.0,
"Broken Bells": 4.0,
"Deadmau5": 4.5,
"Phoenix": 3.0,
"Slightly Stoopid": 4.5,
"The Strokes": 4.0,
"Vampire Weekend": 2.0
},
"Hailey": {
"Broken Bells": 4.0,
"Deadmau5": 1.0,
"Norah Jones": 4.0,
"The Strokes": 4.0,
"Vampire Weekend": 1.0
},
"Jordyn": {
"Broken Bells": 4.5,
"Deadmau5": 4.0,
"Norah Jones": 5.0,
"Phoenix": 5.0,
"Slightly Stoopid": 4.5,
"The Strokes": 4.0,
"Vampire Weekend": 4.0
},
"Sam": {
"Blues Traveler": 5.0,
"Broken Bells": 2.0,
"Norah Jones": 3.0,
"Phoenix": 5.0,
"Slightly Stoopid": 4.0,
"The Strokes": 5.0
},
"Veronica": {
"Blues Traveler": 3.0,
"Norah Jones": 5.0,
"Phoenix": 4.0,
"Slightly Stoopid": 2.5,
"The Strokes": 3.0
}
}
我想將其轉換為.csv文件,以便在Excel中打開該文件時,會得到一個表格,其中的歌曲位於行側,名稱位於列側:
是否有任何內置的python方法可以幫助我實現這一目標?
您必須將包含行的列轉置為包含列的行。 在這里使用collections.defaultdict()
對象是最簡單的:
rows = defaultdict(dict)
for user, artists in users.iteritems():
for artist, count in artists.iteritems():
rows[artist][user] = count
現在,您有了可以直接寫入csv.DictWriter()
字典:
with open(csvfilename, 'wb') as outf:
writer = csv.DictWriter(outf, [''] + users.keys())
writer.writeheader()
writer.writerows(dict(row, **{'': key}) for key, row in rows.iteritems())
需要生成器表達式為rows
字典中的每個值添加添加的第一列鍵值對。
演示:
>>> from collections import defaultdict
>>> import csv
>>> users = { ... } # elided for brevity
>>> rows = defaultdict(dict)
>>> for user, artists in users.iteritems():
... for artist, count in artists.iteritems():
... rows[artist][user] = count
...
>>> import sys
>>> writer = csv.DictWriter(sys.stdout, [''] + users.keys())
>>> writer.writeheader()
,Angelica,Veronica,Sam,Jordyn,Dan,Bill,Chan,Hailey
>>> writer.writerows(dict(row, **{'': key}) for key, row in rows.iteritems())
The Strokes,2.5,3.0,5.0,4.0,4.0,,,4.0
Blues Traveler,3.5,3.0,5.0,,3.0,2.0,5.0,
Phoenix,5.0,4.0,5.0,5.0,3.0,2.0,5,
Broken Bells,2.0,,2.0,4.5,4.0,3.5,1.0,4.0
Deadmau5,,,,4.0,4.5,4.0,1.0,1.0
Norah Jones,4.5,5.0,3.0,5.0,,,3.0,4.0
Slightly Stoopid,1.5,2.5,4.0,4.5,4.5,3.5,1.0,
Vampire Weekend,2.0,,,4.0,2.0,3.0,,1.0
嘗試這個
import csv
# Create header line
a = ["Album/Track"] + users.keys()
# Create unique keys.
x = list(set([y for z in users.values() for y in z.keys()]))
# Create rows
rows = [a]+[[q]+[users[p].get(q, '-') for p in a[1:]] for q in x]
with open('my.csv', 'wb') as csvfile:
writer = csv.writer(csvfile)
for row in rows:
writer.write(row)
import pandas as pd
data = pd.DataFrame(users)
data = data.fillna("-")
data.to_csv("./users.csv")
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.