繁体   English   中英

如何使用正则表达式在Python中获取段落中的最后一个单词

[英]How to get the last word(s) in a paragraph in Python using regex

我正在寻找一种方法来提取一行中的最后一个单词。 我只想提取名字:Mike我的代码是

import re

text_to_search = '''
I like Apples and bananas 
I like fruits and yogurt
thisUser: Your name : Mike Lewis
Email: mike@mail.com
type: Fullresopnse
'''
pattern = re.compile(r'thisUser: Your name :\s[A-Z]\w+')

matches = pattern.search(text_to_search)

print(matches)

运行这段代码让我:

re.Match object; span=(54, 80), match='thisUser: Your name : Mike'

如何只打印"Mike""Mike lewis"

您可以在这里尝试使用re.findall

matches = re.findall(r'\bYour name\s*:\s*(\S+)\s+(\S+)', text_to_search)
print("first name: " + matches[0][0])
print("last name: " + matches[0][1])

打印:

first name: Mike
last name: Lewis

re.findall在这里的潜在优势可能是,如果您希望文本中包含多个名称条目。

此表达式具有一个捕获组,该组将返回Mike:

thisUser:\s*Your name\s*:\s*(\S+)

演示

测试

import re

regex = r"thisUser:\s*Your name\s*:\s*(\S+)"

test_str = ("I like Apples and bananas \n"
    "I like fruits and yogurt\n"
    "thisUser: Your name : Mike Lewis\n"
    "Email: mike@mail.com\n"
    "type: Fullresopnse")

matches = re.finditer(regex, test_str, re.MULTILINE)

for matchNum, match in enumerate(matches, start=1):

    print ("Match {matchNum} was found at {start}-{end}: {match}".format(matchNum = matchNum, start = match.start(), end = match.end(), match = match.group()))

    for groupNum in range(0, len(match.groups())):
        groupNum = groupNum + 1

        print ("Group {groupNum} found at {start}-{end}: {group}".format(groupNum = groupNum, start = match.start(groupNum), end = match.end(groupNum), group = match.group(groupNum)))

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM