簡體 English 中英

根據字符串python的start關鍵字和end關鍵字切割一個字符串

[英]Cutting a string based on the start keyword and end key word of the string python

原文 2020-03-15 01:41:23 3 1 python/ apache-tika

我有一個 pdf，我通過 python 中的 Tika 包閱讀了它。 似乎 tika 只能閱讀整個 pdf 而我只需要閱讀第一頁。

我的代碼看起來像：

from tika import parser
raw = parser.from_file(pdfname)
rawtext = raw['content']

我想通過開始關鍵字和結束關鍵字拆分原始文本。 我怎么做？

1 個解決方案

您可以使用regex來選擇您感興趣的文本，例如：

import re


raw_text = 'this is a sample of text'
start = 'is'
end = 'of'

start_index = re.search(r'\b' + start + r'\b', raw_text).start()
end_index = re.search(r'\b' + end + r'\b', raw_text).end()
section_of_text = raw_text[start_index:end_index]
print(section_of_text)

>>> "is a sample of"

用開始和結束詞分割字符串

[英]Split String with Start and End word

用於檢查字符串中單詞的開頭和結尾的python正則表達式

[英]python regular expression to check start and end of a word in a string

根據標簽列表中該單詞的索引位置，查找字符串中單詞的開始和結束位置

[英]Find the start and end position of a word in a string based on the index position of that word from a label list

根據開始和結束位置python從字符串中獲取子字符串

[英]get substring from string based on start and end position python

Python 根據流體起點/終點移除管柱部分

[英]Python remove sections of string based on fluid start/end point

匹配字符串開頭，中間和結尾的完整單詞

[英]Match complete word at start, middle and end of string

根據開始索引和結束索引刪除字符串

[英]removing string based on start index and end index

Python在特定點切割字符串

[英]Python Cutting a string on a certain point

Python中的字符串切割行為很奇怪

[英]String cutting in Python with a weird behaviour

Python切割字符串導致錯誤

[英]Python cutting string results in error

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 用開始和結束詞分割字符串用於檢查字符串中單詞的開頭和結尾的python正則表達式根據標簽列表中該單詞的索引位置，查找字符串中單詞的開始和結束位置根據開始和結束位置python從字符串中獲取子字符串 Python 根據流體起點/終點移除管柱部分匹配字符串開頭，中間和結尾的完整單詞根據開始索引和結束索引刪除字符串 Python在特定點切割字符串 Python中的字符串切割行為很奇怪 Python切割字符串導致錯誤

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM