使用count方法對文本文件中的某個單詞進行計數

Question

我正在嘗試計算單詞“ the”在另存為文本文件的兩本書中出現的次數。 我正在運行的代碼為每本書返回零。

這是我的代碼：

def word_count(filename):
    """Count specified words in a text"""
    try:
        with open(filename) as f_obj:
            contents = f_obj.readlines()
            for line in contents:
                word_count = line.lower().count('the')
            print (word_count)

    except FileNotFoundError:
        msg = "Sorry, the file you entered, " + filename + ", could not be     found."
    print (msg)

dracula = 'C:\\Users\\HP\\Desktop\\Programming\\Python\\Python Crash   Course\\TEXT files\\dracula.txt'
siddhartha = 'C:\\Users\\HP\\Desktop\\Programming\\Python\\Python Crash Course\\TEXT files\\siddhartha.txt'

word_count(dracula)
word_count(siddhartha)

我在這里做錯了什么？

Answer 1

您將為每次迭代重新分配word_count 。 這意味着，在最后這將是相同的出現次數the在文件的最后一行。 你應該得到的總和。 另一件事：應該there匹配嗎？ 可能不是。 您可能要使用line.split() 。 同樣，您可以直接遍歷文件對象。 不需要.readlines() 。 最后，使用生成器表達式進行簡化。 我的第一個示例沒有生成器表達式； 第二個是：

def word_count(filename):
    with open(filename) as f_obj:
        total = 0
        for line in f_obj:
            total += line.lower().split().count('the')
        print(total)

def word_count(filename):
    with open(filename) as f_obj:
        total = sum(line.lower().split().count('the') for line in f_obj)
        print(total)

Answer 2

除非單詞“ the”出現在每個文件的最后一行，否則您將看到零。

您可能希望將word_count變量初始化為零，然后使用增強加法（ += ）：

例如：

def word_count(filename):
    """Count specified words in a text"""
    try:
        word_count = 0                                       # <- change #1 here
        with open(filename) as f_obj:
            contents = f_obj.readlines()
            for line in contents:
                word_count += line.lower().count('the')      # <- change #2 here
            print(word_count)

    except FileNotFoundError:
        msg = "Sorry, the file you entered, " + filename + ", could not be     found."
    print(msg)

dracula = 'C:\\Users\\HP\\Desktop\\Programming\\Python\\Python Crash   Course\\TEXT files\\dracula.txt'
siddhartha = 'C:\\Users\\HP\\Desktop\\Programming\\Python\\Python Crash Course\\TEXT files\\siddhartha.txt'

word_count(dracula)
word_count(siddhartha)

增強添加不是必需的，只是有幫助。 這行：

word_count += line.lower().count('the')

可以寫成

word_count = word_count + line.lower().count('the')

但是，您也不需要一次將所有行讀入內存。 您可以直接從文件對象遍歷各行。 例如：

def word_count(filename):
    """Count specified words in a text"""
    try:
        word_count = 0
        with open(filename) as f_obj:
            for line in f_obj:                     # <- change here
                word_count += line.lower().count('the')
        print(word_count)

    except FileNotFoundError:
        msg = "Sorry, the file you entered, " + filename + ", could not be     found."
        print(msg)

dracula = 'C:\\Users\\HP\\Desktop\\Programming\\Python\\Python Crash Course\\TEXT files\\dracula.txt'
siddhartha = 'C:\\Users\\HP\\Desktop\\Programming\\Python\\Python Crash Course\\TEXT files\\siddhartha.txt'

word_count(dracula)
word_count(siddhartha)

Answer 3

其他方式：

with open(filename) as f_obj:
    contents = f_obj.read()
    print("The word 'the' appears " + str(contents.lower().count('the')) + " times")

Answer 4

import os
def word_count(filename):
    """Count specified words in a text"""
    if os.path.exists(filename):
        if not os.path.isdir(filename):
            with open(filename) as f_obj:
                print(f_obj.read().lower().count('t'))
        else:
            print("is path to folder, not to file '%s'" % filename)
    else:
        print("path not found '%s'" % filename)

使用count方法對文本文件中的某個單詞進行計數

問題描述

4 個解決方案

解決方案1
3 2016-07-31 02:09:22

解決方案2
1 已采納 2016-07-31 02:06:20

解決方案3
1 2018-11-05 17:26:40

解決方案4
0 2016-07-31 02:10:04

使用count方法對文本文件中的某個單詞進行計數

問題描述

4 個解決方案

解決方案1 3 2016-07-31 02:09:22

解決方案2 1 已采納 2016-07-31 02:06:20

解決方案3 1 2018-11-05 17:26:40

解決方案4 0 2016-07-31 02:10:04

解決方案1
3 2016-07-31 02:09:22

解決方案2
1 已采納 2016-07-31 02:06:20

解決方案3
1 2018-11-05 17:26:40

解決方案4
0 2016-07-31 02:10:04