Python字典的求和值

Question

我想知道如何使用Python字典求和。 我逐行读取巨大的文件，并为每个特定键增加值。 假设我有以下玩具文件：

word1 5
word2 3
word3 1
word1 2
word2 1

我期望的预期结果是：

my_dict = {'word1':7, 'word2':4, 'word3':1}

下面粘贴的是我当前的工作。

my_dict = {}          
with open('test.txt') as f:
    for line in f:
        line = line.rstrip()
        line = line.split()
        word = line[0]
        frequency = line[1]
        my_dict[word] += int(frequency)

Answer 1

使用collections.Counter()对象：

from collections import Counter

my_dict = Counter()

with open('test.txt') as f:
    for line in f:
        word, freq = line.split()
        my_dict[word] += int(freq)

需要注意的是str.rstrip()是不需要的，在str.split()不带参数调用也去掉的字符串。

除了将不存在的键默认设置为0外， Counter()对象还有其他优点，例如列出按频率排序的单词（包括前N个），求和和减去。

上面的代码导致：

>>> my_dict
Counter({'word1': 7, 'word2': 4, 'word3': 1})
>>> for word, freq in my_dict.most_common():
...     print word, freq
... 
word1 7
word2 4
word3 1

Answer 2

您可以使用defaultdict ：

import collections
d = collections.defaultdict(int)
with open('text.txt') as f:
    for row in f:
        temp = row.split()
        d[temp[0]] += int(temp[1])

d现在：

defaultdict(<type 'int'>, {'word1': 7, 'word3': 1, 'word2': 4})

Answer 3

万一有人使用多个列（在我的情况下，我有同样的问题，但是有四个列）：

这应该可以解决问题：

from collections import defaultdict

my_dict = defaultdict(int)

with open("input") as f:
    for line in f:
        if line.strip():
            items = line.split()
            freq = items[-1]
            lemma = tuple(items[:-1]) 

            my_dict[lemma] += int(freq)

for items, freq in my_dict.items():
    print items, freq

Python字典的求和值

问题描述

3 个解决方案

解决方案1
4 已采纳 2013-09-07 08:42:29

解决方案2
2 2013-09-07 08:43:17

解决方案3
0 2013-11-07 15:38:01

Python字典的求和值

问题描述

3 个解决方案

解决方案1 4 已采纳 2013-09-07 08:42:29

解决方案2 2 2013-09-07 08:43:17

解决方案3 0 2013-11-07 15:38:01

解决方案1
4 已采纳 2013-09-07 08:42:29

解决方案2
2 2013-09-07 08:43:17

解决方案3
0 2013-11-07 15:38:01