簡體 English 中英

Python正則表達式和標記化

[英]Python regular expressions and tokenization

原文 2014-02-05 02:39:34 4 2 python/ regex/ tokenize

我有一個字符串“ABC一二三”。

我有一個任務是將這個字符串標記為[“ABC”，一，二，三]，忽略句子末尾的句號。 我無法在句子結尾處刪除句號而不會干擾ABC的縮寫詞。

有沒有辦法讓我在句子結尾處刪除句點而不影響使用python正則表達式的首字母縮略詞？

2 個解決方案

word = re.compile(r'[A-Za-z.]*[A-Za-z]')
word.findall("A.B.C one two three.")    # => ['A.B.C', 'one', 'two', 'three']

line= "A.B.C one two three."
print line[:-1].split(' ')

可能也是這樣

使用 python 正則表達式的詞標記化

[英]Word tokenization using python regular expressions

將Perl正則表達式轉換為Python正則表達式

[英]Converting Perl Regular Expressions to Python Regular Expressions

逃避[在Python正則表達式中

[英]Escaping [ in Python Regular Expressions

Python正則表達式

[英]Python regular expressions

正則表達式無法使用Python

[英]Regular Expressions not working Python

Python正則表達式中的空格

[英]Spaces in Python Regular Expressions

具有OR函數的Python正則表達式

[英]Python regular expressions with OR function

正則表達式python

[英]Regular expressions python

正則表達式Python 3.4

[英]Regular Expressions Python 3.4

Python正則表達式替換

[英]Python regular expressions replacing

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 使用 python 正則表達式的詞標記化將Perl正則表達式轉換為Python正則表達式逃避[在Python正則表達式中 Python正則表達式正則表達式無法使用Python Python正則表達式中的空格具有OR函數的Python正則表達式正則表達式python 正則表達式Python 3.4 Python正則表達式替換

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM