Python 從具有特定 substring 的字符串中獲取 N 個字符

Question

我從圖像文件中提取了一個很長的字符串。 字符串看起來像這樣

...\n\nDate: 01.01.2022\n\nArticle-no: 123456789\n\nArticle description: asdfqwer 1234...\n...

如何僅提取 substring "Article-no:"之后的 10 個字符？

我嘗試使用像這樣的 rfind 使用不同的方法來解決它，但是如果開始和結束字符串不准確，它往往會時不時地失敗。

    s = "... string shown above ..."
    start = "Article-no: "
    end = "Article description: "
    print(s[s.find(start)+len(start):s.rfind(end)])

Answer 1

您可以使用split ：

string.split("Article-no: ", 1)[1][0:10]

Answer 2

為此，正則表達式可能會派上用場。

import re

# Create a pattern which matches "Article-no: " literally,
# and then grabs the digits that follow.
pattern = re.compile(r"Article-no: (\d+)")
s = "...\n\nDate: 01.01.2022\n\nArticle-no: 123456789\n\nArticle description: asdfqwer 1234...\n..."

match = pattern.search(s)
if match:
    print(match.group(1))

這輸出：

123456789

使用的正則表達式是Article-no: (\d+) ，它有以下部分：

Article-no:      # Match this text literally
(                # Open a new group (i.e. group 1)
\d+              # Match 1 or more occurrences of a digit
)                # Close group 1

re模塊將在字符串中搜索匹配的位置，然后您可以從匹配中提取數字。

Python 從具有特定 substring 的字符串中獲取 N 個字符

問題描述

2 個解決方案

解決方案1
4 已采納 2022-01-26 14:13:31

解決方案2
0 2022-01-26 14:11:50

Python 從具有特定 substring 的字符串中獲取 N 個字符

問題描述

2 個解決方案

解決方案1 4 已采納 2022-01-26 14:13:31

解決方案2 0 2022-01-26 14:11:50

解決方案1
4 已采納 2022-01-26 14:13:31

解決方案2
0 2022-01-26 14:11:50