查找列表中以某些字母开头的单词

Question

I am trying to output the total of how many words start with a letter 'a' in a list from a separate text file. 我试图从单独的文本文件中输出列表中以字母'a'开头'a'单词总数。 I'm looking for an output such as this. 我正在寻找这样的输出。

35 words start with a letter 'a'.

However, i'm outputting all the words that start with an 'a' instead of the total with my current code. 但是，我正在输出以'a'开头的所有单词，而不是当前代码中的全部单词。 Should I be using something other than a for loop? 我是否应该使用for循环以外的其他方式？

So far, this is what I have attempted: 到目前为止，这是我尝试过的：

wordsFile = open("words.txt", 'r')
words = wordsFile.read()
wordsFile.close()
wordList = words.split()

print("Words:",len(wordList)) # prints number of words in the file.

a_words = 0

for a_words in wordList:
    if a_words[0]=='a':
        print(a_words, "start with the letter 'a'.")

The output I'm getting thus far: 到目前为止，我得到的输出是：

Words: 334
abate start with the letter 'a'.
aberrant start with the letter 'a'.
abeyance start with the letter 'a'.

and so on. 等等。

Answer 1

You could replace this with a sum call in which you feed 1 for every word in wordList that starts with a : 你可以用替换此sum通话中你喂1中的每一个字wordList是开头a ：

print(sum(1 for w in wordList if w.startswith('a')), 'start with the letter "a"')

This can be further trimmed down if you use the boolean values returned by startswith instead, since True is treated as 1 in these contexts the effect is the same: 如果您使用startswith返回的布尔值来代替，则可以进一步缩小，因为在这些情况下， True被视为1 ，因此效果是相同的：

print(sum(w.startswith('a') for w in a), 'start with the letter "a"')

With your current approach, you're not summing anything, you're simply printing any word that matches. 使用当前的方法，您无需求和，仅打印任何匹配的单词。 In addition, you're re-naming a_word from an int to the contents of the list as you iterate through it. 另外，您在迭代时将a_word从一个int重命名为列表的内容。

Also, instead of using a_word[0] to check for the first character, you could use startswith(character) which has the same effect and is a bit more readable. 另外，您可以使用startswith(character)来代替第一个字符，而不用使用a_word[0]来检查第一个字符，该命令具有相同的效果并且可读性更高。

Answer 2

You are using the a_words as the value of the word in each iteration and missing a counter. 您在每次迭代中都使用a_words作为单词的值，并且缺少计数器。 If we change the for loop to have words as the value and reserved a_words for the counter, we can increment the counter each time the criteria is passed. 如果我们更改for循环以将words作为值并为计数器保留a_words ，则每次通过标准时，我们都可以递增计数器。 You could change a_words to wordCount or something generic to make it more portable and friendly for other letters. 您可以将a_words更改为wordCount或其他通用名称，以使其对其他字母更易于携带和友好。

a_words = 0

for words in wordList:
    if words[0]=='a':
        a_words += 1

print(a_words, "start with the letter 'a'.")

Answer 3

sum(generator) is a way to go, but for completeness sake, you may want to do it with list comprehension (maybe if it's slightly more readable or you want to do something with words starting with a etc.). sum(generator)是一种可行的方法，但是出于完整性考虑，您可能希望通过列表理解来实现（也许可读性更高，或者您想要对以等开头的单词进行处理）。

words_starting_with_a = [word for word in word_list if word.startswith('a')]

After that you may use len built-in to retrieve length of your new list. 之后，您可以使用内置的len来检索新列表的长度。

print(len(words_starting_with_a), "words start with a letter 'a'")

Answer 4

Simple alternative solution using re.findall function(without splitting text and for loop): 使用re.findall函数的简单替代解决方案（不拆分文本和for循环）：

import re
...
words = wordsFile.read()
...
total = len(re.findall(r'\ba\w+?\b', words))
print('Total number of words that start with a letter "a" : ', total)

查找列表中以某些字母开头的单词

问题描述

4 个解决方案

解决方案1
3 2016-09-14 19:16:46

解决方案2
2 已采纳 2016-09-14 19:22:34

解决方案3
1 2016-09-14 19:34:21

解决方案4
0 2016-09-14 19:39:31

查找列表中以某些字母开头的单词

问题描述

4 个解决方案

解决方案1 3 2016-09-14 19:16:46

解决方案2 2 已采纳 2016-09-14 19:22:34

解决方案3 1 2016-09-14 19:34:21

解决方案4 0 2016-09-14 19:39:31

解决方案1
3 2016-09-14 19:16:46

解决方案2
2 已采纳 2016-09-14 19:22:34

解决方案3
1 2016-09-14 19:34:21

解决方案4
0 2016-09-14 19:39:31