使用正則表達式解析多行字符串

Question

這是我要解析的完整字符串：

Response
--------
{
  Return Code: 1
  Key        : <None>
  Files      : [
    {
      Return Code: 0
      Data       : 'Value' is 1
'Value' is two
This is third line of output
    }
  ]
}

這就是我希望解析后的文本看起來像的樣子：

'Value' is 1
'Value' is two
This is third line of output

我已經嘗試過re.findall()但是我無法得到我想要的。
這是一個python腳本，試圖使用正則表達式進行解析。

import subprocess,re
output = subprocess.check_output(['staf', 'server.com', 'PROCESS', 'START', 'SHELL', 'COMMAND', "'uname'", 'WAIT', 'RETURNSTDOUT', 'STDERRTOSTDOUT'])
result = re.findall(r'Data\s+:\s+(.*)', output, re.DOTALL)[0]
print result

腳本輸出

[root@server ~]# python test.py 
''uname'' is not recognized as an internal or external command,
operable program or batch file.

    }
  ]
}

Answer 1

選項1

如果要在Data:之后添加三行，則可以執行以下操作，將三行捕獲到組1中：

match = re.search(r"Data\s*:\s*((?:[^\n]*[\r\n]+){3})", subject)
if match:
    result = match.group(1)

選項2

如果要在Data:之后的所有行，在包含}的第一行之前，將正則表達式更改為：

Data\s*:\s*((?:[^\n]*(?:[\r\n]+(?!\s*}))?)+)

Answer 2

使用以下正則表達式，您將找到所需的三個字符串。

請注意，這在很大程度上取決於響應的格式。

>>> import re
>>> response = """
Response
--------
{
  Return Code: 1
  Key        : <None>
  Files      : [
    {
      Return Code: 0
      Data       : 'Value' is 1
'Value' is two
This is third line of output
    }
  ]
}"""
>>> re.findall(r"('Value'.*)\n(.*)\n(.*)\n.*}",response)
[("'Value' is 1", "'Value' is two", 'This is third line of output')]

您還可以在這樣的組中包括換行符：

>>> re.findall(r"('Value'.*\n)(.*\n)(.*\n).*}",response)
[("'Value' is 1\n", "'Value' is two\n", 'This is third line of output\n')]

取決於您以后如何處理。

更新

這個怎么樣？

>>> re.findall(r"Data\s*:\s*(.*?)}",response,re.DOTALL)
["'Value' is 1\n'Value' is two\nThis is third line of output\n    "]

這將找到從第一個“值”到第一個“}”的所有內容。

使用正則表達式解析多行字符串

問題描述

2 個解決方案

解決方案1
0 已采納 2014-05-22 10:36:17

解決方案2
0 2014-05-22 10:47:36

使用正則表達式解析多行字符串

問題描述

2 個解決方案

解決方案1 0 已采納 2014-05-22 10:36:17

解決方案2 0 2014-05-22 10:47:36

解決方案1
0 已采納 2014-05-22 10:36:17

解決方案2
0 2014-05-22 10:47:36