python regex匹配行，在字符串后包含數字，字符串末尾有數字

Question

我使用正則表達式捕獲文件中的文本，但是字符串包含錯誤的數字。 我沒有捕獲它，但是在嘗試捕獲下一行時，它僅返回字符串，而不返回下一行。 當沒有錯誤的尾隨數字時，我能夠捕獲它。

我已經嘗試過許多正則表達式的組合，但尚未成功。

文本：

sentences
company_name: company, ltd6

numbers 99 and letters 99 (I want to match anything here and nothing after)
numbers 99 and letters 99 (I don't want to match anything here or after)

成功捕獲正則表達式但帶有數字的代碼：

company_name = re.findall(r"company_name:\s(.*)\D.+", text)

成功捕獲不帶數字的正則表達式的代碼：

company_name = re.findall(r"company_name:\s(.*)(?=.\D.+)", text)

嘗試捕獲以下行：

next_line = re.findall(r"company_name:\s(.*)(?=.\D.+).*", text)

我希望捕獲下一行，但不要。

Answer 1

這將僅獲得下一行，而忽略后續行：

next_line = re.sub(r".*company_name:[^\n]+\n*([^\n]+).*", r'\1', text, flags=re.S)

即： numbers 99 and letters 99 (I want to match anything here and nothing after)

Answer 2

根據您的原始表達方式，我猜測可能是這種表達方式，

.*company_name:\s*(.*\D)\s*(\w.*)

可能有用。 我們有兩組(.*\\D)和(\\w.*) ，其中捕獲了我們想要的輸出。

演示1

也許這個：

.*company_name:\s*(.*)\s*(\w.*)

演示2

測試

import re

regex = r".*company_name:\s*(.*\D)\s*(\w.*)"

test_str = ("sentences\n"
    "company_name: company, ltd6\n\n"
    "numbers 99 and letters 99 (I want to match anything here)")

matches = re.finditer(regex, test_str, re.MULTILINE)

for matchNum, match in enumerate(matches, start=1):

    print ("Match {matchNum} was found at {start}-{end}: {match}".format(matchNum = matchNum, start = match.start(), end = match.end(), match = match.group()))

    for groupNum in range(0, len(match.groups())):
        groupNum = groupNum + 1

        print ("Group {groupNum} found at {start}-{end}: {group}".format(groupNum = groupNum, start = match.start(groupNum), end = match.end(groupNum), group = match.group(groupNum)))

python regex匹配行，在字符串后包含數字，字符串末尾有數字

問題描述

2 個解決方案

解決方案1
0 2019-06-28 02:29:06

解決方案2
0 2019-06-28 02:44:59

演示1

演示2

測試

python regex匹配行，在字符串后包含數字，字符串末尾有數字

問題描述

2 個解決方案

解決方案1 0 2019-06-28 02:29:06

解決方案2 0 2019-06-28 02:44:59

演示1

演示2

測試

解決方案1
0 2019-06-28 02:29:06

解決方案2
0 2019-06-28 02:44:59