Python正则表达式跨行匹配

Question

I have a bibtex file formatted like this: 我有一个bibtex文件，格式如下：

@inproceedings{baz,
    AUTHOR={{Baz}, {S}. and Bar, {G}. and
      Foo, {M}},
    year={2013}
}

I have managed to capture a single entry (the entire text shown above), but I want a regex in Python that matches everything inside the AUTHOR={} brackets (across the newline). 我设法捕获了一个条目（上面显示的整个文本），但是我想要Python中的正则表达式与AUTHOR={}括号内的所有内容匹配（跨换行符）。 How can I do this in Python? 如何在Python中执行此操作？

Answer 1

re.compile(r"AUTHOR={([\sA-Za-z{},\.]+)},$", re.MULTILINE)

Answer 2

You can use the following regex that checks for 1 level of nested curly braces: 您可以使用以下正则表达式检查1级嵌套花括号：

(?ims)author\s*=\s*[{"]((?:[^{}]+?|{[^}]+?})+?)[}"]

See demo 观看演示

Sample code on IDEONE : IDEONE上的示例代码：

import re
p = re.compile(r'(?ims)author\s*=\s*[{"]((?:[^{}]+?|{[^}]+?})+?)[}"]')
test_str = "@inproceedings{baz,\n    AUTHOR = {{Baz}, {S}. and Bar, {G}. and\n      Foo, {M}},\n    year={2013}\n}\n@inproceedings{baz,\n    AUTHOR={{%Baz%}, {S!}. and Bar, {^G^}. and\n      Foo, {<M>}},\n    year={2013}\n}\n"
print [x.group(1) for x in re.finditer(p, test_str)]

Python正则表达式跨行匹配

问题描述

2 个解决方案

解决方案1
2 2015-05-21 17:01:52

解决方案2
1 已采纳 2015-05-21 20:01:47

Python正则表达式跨行匹配

问题描述

2 个解决方案

解决方案1 2 2015-05-21 17:01:52

解决方案2 1 已采纳 2015-05-21 20:01:47

解决方案1
2 2015-05-21 17:01:52

解决方案2
1 已采纳 2015-05-21 20:01:47