Python检查是否存在行（两个关键字）而忽略了之间的空白

Question

My example file have a structure: 我的示例文件具有以下结构：

Library      Collections
Library      XML    use_lxml=True
Library      ute_wtssim
Library      libraries.ta_infomodel
Library      ute_tshark
Library      PDMLChecker.py
Library    libraries.ta_ue_configuration
Library      libraries.ta_tshark
Library    libraries.ta_file_system
Resource     environment_setup.robot
Variables    ../tshark_filters.py
Variables    ../global_variables.py
Variables    tracing_tshark_filters.py

I would like to find in file line: Library libraries.ta_infomodel ignoring whitespaces (column width, which is not constant). 我想在文件行中找到： Library libraries.ta_infomodel忽略空格（列宽，这不是恒定的）。

Could you give me advice? 你能给我建议吗？

EDIT: I would like to check if line exist... Line which contains exactly two keywords, ignoring white spaces between. 编辑：我想检查是否存在行...行，其中包含正好两个关键字，忽略之间的空白。

Answer 1

Just use \\s* . 只需使用\\s* 。 \\s stands for a whitespace. \\s代表空格。 * means any number of (including zero): *表示任意数量（包括零）：

import re
s = '''Library      Collections
Library      XML    use_lxml=True
Library      ute_wtssim
Library      libraries.ta_infomodel
Library      ute_tshark
Library      PDMLChecker.py
Library      resources.DevWro1.pdml_validation.PdmlValidation
Library    libraries.ta_ue_configuration
Library      libraries.ta_tshark
Library    libraries.ta_file_system
Resource     environment_setup.robot
Variables    ../tshark_filters.py
Variables    ../global_variables.py
Variables    tracing_tshark_filters.py'''

matches = re.findall('Library\s*libraries\.ta_infomodel', s)

for match in matches:
    print(match)

Answer 2

Just try this: 尝试一下：

content = """
Library      Collections
Library      XML    use_lxml=True
Library      ute_wtssim
Library      libraries.ta_infomodel
Library      ute_tshark
Library      PDMLChecker.py
Library      resources.DevWro1.pdml_validation.PdmlValidation
Library    libraries.ta_ue_configuration
Library      libraries.ta_tshark
Library    libraries.ta_file_system
Resource     environment_setup.robot
Variables    ../tshark_filters.py
Variables    ../global_variables.py
Variables    tracing_tshark_filters.py
"""

import re
regex = re.compile(r'Library\s+libraries\.ta_infomodel')
line = regex.search(content)

Some explanations. 一些解释。 In order to find one line by a regex, you should use re.search , for many re.findall . 为了通过正则表达式查找一行，您应该对许多re.findall使用re.search 。

\\s+ - for several spaces. \\s+ -多个空格。

\\. - for the dot. -点。 Use have to add the slash because just dot means any character. 使用必须添加斜杠，因为仅点表示任何字符。

Answer 3

Can be done with list comprehension too 也可以使用列表理解

import re
lines = open(fname, 'r').readlines()
found = [s for s in lines if re.match(".* Library\s+libraries\.ta_infomodel.*", s)]

Python检查是否存在行（两个关键字）而忽略了之间的空白

问题描述

3 个解决方案

解决方案1
1 2017-12-01 15:29:40

解决方案2
0 已采纳 2017-12-01 15:39:29

解决方案3
-1 2017-12-01 15:43:12

Python检查是否存在行（两个关键字）而忽略了之间的空白

问题描述

3 个解决方案

解决方案1 1 2017-12-01 15:29:40

解决方案2 0 已采纳 2017-12-01 15:39:29

解决方案3 -1 2017-12-01 15:43:12

解决方案1
1 2017-12-01 15:29:40

解决方案2
0 已采纳 2017-12-01 15:39:29

解决方案3
-1 2017-12-01 15:43:12