用Python计算文件中的单词

Question

我是Python的新手，但令我惊讶的是，我编写了下面的代码：

if __name__ == "__main__":
with open("wordlist.txt") as infile:
    for line in infile:
        print(line)    



with open ("cv000_29416.txt", "r") as myfile:
   data=myfile.read().replace('\n', '')
print (data.count("bad"))

关键是，我想计算cv000_29416.txt中wordlist.txt中的单词。

（因此，wordlist.txt包含20个单词，例如“坏”，“好”等，而cv000_29416.txt是一个长文本，我想计算cv000_29416.txt中出现“坏”，“好”等次数的时间）

我可以在几秒钟的代码中插入它吗？

谢谢！ 对不起，英语不好

Answer 1

# create a collection of the words that want to count
with open('wordlist.txt') as infile:
    counts = {}
    for line in infile:
        for word in line.split():
            counts[word] = 0

# increment the count of the words that you really care about
with open("cv000_29416.txt") as infile:
    for line in infile:
        for word in line.split():
            if word in counts:
                counts[word] += 1

for word,count in counts.items():
    print(word, "appeared", count, "times")

Answer 2

使用collections.Counter字典来计算所有单词：

from collections import Counter
with open ("cv000_29416.txt", "r") as myfile:
   data = Counter(myfile.read().split())
print (data["bad"])

综上所述，假定每个单词都在wordlist.txt中的单独一行上：

from collections import Counter
with open ("cv000_29416.txt", "r") as myfile,open("wordlist.txt") as infile:
    data = Counter(myfile.read().split())
    for line in infile:
        print(data.get(line.rstrip(),0))

用Python计算文件中的单词

问题描述

2 个解决方案

解决方案1
3 2014-11-22 23:38:51

解决方案2
2 2014-11-22 23:37:10

用Python计算文件中的单词

问题描述

2 个解决方案

解决方案1 3 2014-11-22 23:38:51

解决方案2 2 2014-11-22 23:37:10

解决方案1
3 2014-11-22 23:38:51

解决方案2
2 2014-11-22 23:37:10