[英]Converting CSV data to list in dictionary
我有一個CSV文件,格式如下:
Name_1,2,K,14
Name_1,3,T,14
Name_1,4,T,18
Name_2,2,G,12
Name_2,4,T,14
Name_2,6,K,15
Name_3,2,K,12
Name_3,3,T,15
Name_3,4,G,18
我想將其轉換為字典,其中Name_x
是鍵,相應的數據是列表形式的值。 像這樣的東西:
{'Name_1': [[2, 'K', 14], [3, 'T', 14], [4, 'T', 18]],
'Name_2': [[4, 'T', 14], [4, 'T', 14], [6, 'K' ,15]],
...}
到目前為止,我認為我必須使用use defaultdict
:
from collections import defaultdict
d = defaultdict(list)
但是如何append
數據append
到d
? 我知道defaultdict
沒有append
方法。
您需要使用名稱作為鍵並將行的切片附加為值,使用normal或defaultdict將沒有順序:
import csv
from collections import defaultdict
with open('in.csv') as f:
r = csv.reader(f)
d = defaultdict(list)
for row in r:
d[row[0]].append(row[1:])
print(d)
如果你想維持秩序,你需要一個OrderedDict
:
from collections import OrderedDict
with open('in.csv') as f:
r = csv.reader(f)
od = OrderedDict()
for row in r:
# get key/ first element in row
key = row[0]
# create key/list paring if it does not exist, else just append the value
od.setdefault(key, []).append(row[1:])
print(od)
輸出:
OrderedDict([('Name_1', [['2', 'K', '14'], ['3', 'T', '14'], ['4', 'T', '18']]), ('Name_2', [['2', 'G', '12'], ['4', 'T', '14'], ['6', 'K', '15']]), ('Name_3', [['2', 'K', '12'], ['3', 'T', '15'], ['4', 'G', '18']])])
如果名稱被分組,您還可以使用groupby,它將根據每行中的第一個項目/名稱對元素進行分組:
import csv
from collections import OrderedDict
from itertools import groupby
from operator import itemgetter
with open('in.csv') as f:
r = csv.reader(f)
od = OrderedDict()
for k, v in groupby(r, key=itemgetter(0)):
od[k] = [sub[1:] for sub in v]
如果您使用的是python3,可以使用*
解壓縮:
with open("in.csv") as f:
r = csv.reader(f)
od = OrderedDict()
for row in r:
key, *rest = row
od.setdefault(key, []).append(rest)
import csv
from collections import OrderedDict
from itertools import groupby
from operator import itemgetter
with open('in.csv') as f:
r = csv.reader(f)
od = OrderedDict()
for k, v in groupby(r, key=itemgetter(0)):
od[k] = [sub for _, *sub in v]
print(od)
txtcsv="""Name_1,2,K,14
Name_1,3,T,14
Name_1,4,T,18
Name_2,2,G,12
Name_2,4,T,14
Name_2,6,K,15
Name_3,2,K,12
Name_3,3,T,15
Name_3,4,G,18"""
def save():
with open("test.csv","w") as f:
f.write(txtcsv)
if __name__ == "__main__":
save()
with open("test.csv") as f:
d = {}
for l in f.readlines():
name, val = l.rstrip().split(",", 1)
d.setdefault(name, []).append(val.split(","))
print (d)
在我的頭頂(因為我不太熟悉defaultdict),這應該大致按照你想要的。
data是CSV字符串
obj = {}
data = data.split('\n')
for row in data:
row = row.split(',')
if row[0] in obj:
obj[row[0]].append(row[1:])
else:
obj[row[0]] = [row[1:]]
print obj
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.