仅提取首字母大写的整个单词

Question

I have a text file need to be analyzed here, what I am interested is only the whole word with the first letter capitalized,我这里有一个文本文件需要分析，我感兴趣的只是第一个字母大写的整个单词，

For example: test string: Everyday HOLDS the poSSibility Of A Miracle例如：测试字符串： Everyday HOLDS the poSSibility Of A Miracle

I want to capture: Everyday Of A Miracle我想捕捉： Everyday Of A Miracle

I am currently trying to build my regular expression in Python, strangely, my regex only can capture the first whole word that is captalized.我目前正在尝试在 Python 中构建我的正则表达式，奇怪的是，我的正则表达式只能捕获第一个大写的整个单词。

Test String: Everyday HOLDS the poSSibility Of A Miracle测试字符串： Everyday HOLDS the poSSibility Of A Miracle

My regex: ^([AZ])?([az])+我的正则表达式： ^([AZ])?([az])+

Capture: Everyday捕获： Everyday

What am I missing here ?我在这里错过了什么？

Answer 1

Instead of anchoring the regex at the beginning of the string, utilize boundary checking:不是将正则表达式锚定在字符串的开头，而是利用边界检查：

import re
s = 'Everyday HOLDS the poSSibility Of A Miracle'
new_s = ' '.join(re.findall(r'\b[A-Z][a-z]+|\b[A-Z]\b', s))

Output:输出：

'Everyday Of A Miracle'

Answer 2

Without regex (only if words are delimited by whitespaces):没有正则表达式（仅当单词由空格分隔时）：

>>> s='Everyday HOLDS the poSSibility Of A Miracle'
>>> [x for x in s.split() if x.title()==x]
['Everyday', 'Of', 'A', 'Miracle']

Note that you can also use re.split to split on any non-letter characters.请注意，您还可以使用 re.split 拆分任何非字母字符。

仅提取首字母大写的整个单词

问题描述

2 个解决方案

解决方案1
4 2018-05-06 19:49:59

解决方案2
0 2018-05-06 20:02:59

仅提取首字母大写的整个单词

问题描述

2 个解决方案

解决方案1 4 2018-05-06 19:49:59

解决方案2 0 2018-05-06 20:02:59

解决方案1
4 2018-05-06 19:49:59

解决方案2
0 2018-05-06 20:02:59