解析字符串模式-Python

Question

I have a string pattern (for a xml test reporter) in the following pattern: 我在以下模式中有一个字符串模式（用于xml测试报告程序）：

'testsets.testcases.[testset].[testcase]-[date-stamp]'

For example: 例如：

a='testsets.testcases.test_different_blob_sizes.TestDifferentBlobSizes-20150430130436'

I know I always can parse the testset and testcase names by doing: 我知道我总是可以通过执行以下操作来解析testcase testset和testcase名称：

temp = a.split("-")[0]
current = temp.split(".")
testset = '.'.join(current[:-1]) + ".py"
testcase = current[-1]

However, I want to accomplish that using a more pythonic way, like regex or any other expression that I would do it in a single line. 但是，我想使用一种更Python的方式来实现这一点，例如regex或我将在一行中完成的任何其他表达式。 How can I accomplish that? 我该怎么做？

Answer 1

You can try: 你可以试试：

testset, testcase = re.search('(.*)\.(.*)-.*', a).group(1, 2)
testset += '.py'

re.search returns a MatchObject on matches, and it has a group method we can use to extract match groups for the regex ("()"s in the regex). re.search在匹配MatchObject上返回MatchObject ，它具有一个group方法，可用于为正则表达式（正则表达式中的“（）”）提取匹配组。

Answer 2

只需使用groups ，从正则表达式搜索组获得：

data = re.search(r'.+\..+\.(.+)\.(.+)-(\d+)', string).groups()

Answer 3

If you strictly want to pull out the testset and testcase, ie "test_different_blob_sizes" and "TestDifferentBlobSizes", as in the first part of your question, you can just do: 如您在问题的第一部分中一样，如果严格要提取测试集和测试用例，即“ test_different_blob_sizes”和“ TestDifferentBlobSizes”，则可以执行以下操作：

testset, testcase = re.split('[.-]',s)[2:4]

For compact regexp-based code based on what you have, see Ziyao Wei's response. 有关基于所拥有内容的紧凑型正则表达式代码的信息，请参见Ziyao Wei的回复。

解析字符串模式-Python

问题描述

3 个解决方案

解决方案1
3 已采纳 2015-05-19 16:33:16

解决方案2
2 2015-05-19 16:35:43

解决方案3
0 2015-05-19 16:46:31

解析字符串模式-Python

问题描述

3 个解决方案

解决方案1 3 已采纳 2015-05-19 16:33:16

解决方案2 2 2015-05-19 16:35:43

解决方案3 0 2015-05-19 16:46:31

解决方案1
3 已采纳 2015-05-19 16:33:16

解决方案2
2 2015-05-19 16:35:43

解决方案3
0 2015-05-19 16:46:31