如何使用python regex删除字符串中特定单词之前和之后的文本

Question

I have a string "copy table a (no = 1, name = xyz, city = c0nl ) from 'a.dat';". 我有一个字符串“复制表a（no = 1，name = xyz，city = c0nl）来自'a.dat';”。 In this I want to remove the words within 'copy' and 'from', but need file-name as: my desirable output is "copy a from a.dat;" 在这里我想删除'copy'和'from'中的单词，但需要file-name为：我理想的输出是“从a.dat复制a;”

Any help would be great. 任何帮助都会很棒。 I want to use regular expression for that. 我想使用正则表达式。

Answer 1

You can use the regex module re and the function sub (replace/substitute) in conjunction with lookahead (?=from) and lookbehind (?<=copy ) - also referred to as lookaround , in order to remove only the requested part (.*) that comes in-between: 您可以将regex模块re和函数sub （替换/替换）与lookahead (?=from)和lookbehind (?<=copy ) - 也称为lookaround ，以便仅删除请求的部分(.*)介于两者之间：

import re
print re.sub(r'(?<=copy )(.*)(?=from)', '', "copy table values from 'a.dat';")

OUTPUT OUTPUT

copy from 'a.dat';

Answer 2

You can do: 你可以做：

import re
mystr = "copy table values from 'a.dat';"
print(re.sub('copy.*from', 'copy from', mystr))

And you don't worry about spaces, greedyness and all that. 而且你不担心空间，贪婪等等。

Answer 3

(?<=\bcopy\b)[\s\S]*?(?=\s*\bfrom\b)

Use \\b and lookarounds .See demo. 使用\\b和lookarounds参见演示。

https://regex101.com/r/sS2dM8/11 https://regex101.com/r/sS2dM8/11

import re
p = re.compile(r'(?<=\bcopy\b)[\s\S]*?(?=\s*\bfrom\b)', re.MULTILINE)
test_str = "copy table values from 'a.dat';"
subst = ""

result = re.sub(p, subst, test_str)

Output: copy from 'a.dat'; 输出： copy from 'a.dat';

如何使用python regex删除字符串中特定单词之前和之后的文本

问题描述

3 个解决方案

解决方案1
5 2015-08-27 05:16:33

解决方案2
1 2015-08-27 07:44:01

解决方案3
0 2015-08-27 05:20:18

如何使用python regex删除字符串中特定单词之前和之后的文本

问题描述

3 个解决方案

解决方案1 5 2015-08-27 05:16:33

解决方案2 1 2015-08-27 07:44:01

解决方案3 0 2015-08-27 05:20:18

解决方案1
5 2015-08-27 05:16:33

解决方案2
1 2015-08-27 07:44:01

解决方案3
0 2015-08-27 05:20:18