使用正則表達式和 python 中的字符串中的數據提取復雜的 substring

Question

我有一個字符串說

text = 'i have on 31-Dec-08 USD 5234765 which I gave it in the donation"

我試過了：

pattern = r"^[\d]{2}.*,[\d]{3}$"
data = re.findall(pattern, text)

for s in data:
    print(s)

我想要的 output：

[2008 年 12 月 31 日，美元，5234765]

Answer 1

你可以那樣做

import re

regex = r"(\w+-\w+-\w+)|([A-Z]{3})|(\d+)"

test_str = "i have on 31-Dec-08 USD 5234765 which I gave it in the donation"


matches = re.findall(regex, test_str)
temp = [_ for tupl in matches for _ in tupl if _]

print(temp) #['31-Dec-08', 'USD', '5234765']

\w匹配任何單詞字符（相當於[a-zA-Z0-9_] ）
+匹配前一個令牌一次到無限次，盡可能多次，根據需要回饋（貪婪）
-匹配字符 - 字面意思（區分大小寫）
[AZ]{3}匹配大寫字母 3 次。
\d匹配一個數字（相當於[0-9] ）

使用正則表達式和 python 中的字符串中的數據提取復雜的 substring

問題描述

1 個解決方案

解決方案1
0 2021-03-30 20:50:55

使用正則表達式和 python 中的字符串中的數據提取復雜的 substring

問題描述

1 個解決方案

解決方案1 0 2021-03-30 20:50:55

解決方案1
0 2021-03-30 20:50:55