REGEX-（使用Python 3.5）-在文件中查找字符串

Question

我正在打開一個.msg前景文件，需要從中提取一些特定數據。 我對regex還是有點陌生，找不到我需要的東西。

以下是文件中的數據，其中包含一些選項卡，看起來像是fyi：

NEWS ID:    918273/1
TITLE:  News Platform Solution Overview (CNN) (US English Session)
ACCOUNT:    supernewsplatformacct (55712)

Your request has been completed.

Output Format   MP4

Please click on the "Download File" link below to access the download page.

Download File <http://news.downloadwebsitefake.com/newsid/file1294757493292848575.mp4>

我需要：

918273 -from- NEWS ID: 918273/1

News Platform Solution Overview (CNN) (US English Session) -from- TITLE: News Platform Solution Overview (CNN) (US English Session)

supernewsplatformacct -from- ACCOUNT: supernewsplatformacct (55712)

http://news.downloadwebsitefake.com/newsid/file1294757493292848575.mp4 -from- Download File <http://news.downloadwebsitefake.com/newsid/file1294757493292848575.mp4>

我正在努力

[\\n\\r][ \\t]*NEWS ID:[ \\t]*([^\\n\\r]*)

但是沒有運氣。 任何幫助將不勝感激！

Answer 1

(?:^|(?<=\n))[^:<\n]*[:<](.*)

您可以將其與re.findall一起使用。請re.findall演示。

https://regex101.com/r/d7RPNB/2

Answer 2

msg = """NEWS ID:    918273/1
TITLE:  News Platform Solution Overview (CNN) (US English Session)
ACCOUNT:    supernewsplatformacct (55712)

Your request has been completed.

Output Format   MP4

Please click on the "Download File" link below to access the download page.

Download File <http://news.downloadwebsitefake.com/newsid/file1294757493292848575.mp4>"""
import re
regex = r'[^:]+:\s+(.*)$|[^<]+<([^>]+)>'
matches = [re.match(regex, i).group(1) or re.match(regex, i).group(2) for i in msg.split('\n') if i and re.match(regex, i)]
print(matches)

REGEX-（使用Python 3.5）-在文件中查找字符串

問題描述

2 個解決方案

解決方案1
2 已采納 2016-12-09 20:19:11

解決方案2
0 2016-12-09 21:02:51

REGEX-（使用Python 3.5）-在文件中查找字符串

問題描述

2 個解決方案

解決方案1 2 已采納 2016-12-09 20:19:11

解決方案2 0 2016-12-09 21:02:51

解決方案1
2 已采納 2016-12-09 20:19:11

解決方案2
0 2016-12-09 21:02:51