來自乳膠線的REGEX解析命令 - Python

Question

我正在嘗試從加載的每一行解析並刪除任何\\command （ \\textit等...）（來自.tex文件或來自lilypond文件的其他命令為[\\clef, \\key, \\time] ）。

我怎么能這樣做？

我試過的

import re
f = open('example.tex')
lines = f.readlines()
f.close()

pattern = '^\\*([a-z]|[0-9])' # this is the wrong regex!!
clean = []
for line in lines:
    remove = re.match(pattern, line)
    if remove:
        clean.append(remove.group())

print(clean)

例

輸入

#!/usr/bin/latex

\item More things
\subitem Anything

預期產出

More things
Anything

Answer 1

您可以使用此模式使用簡單的正則表達式替換^\\\\[^\\s]* ：

python中的示例代碼：

import re
p = re.compile(r"^\\[^\s]*", re.MULTILINE)

str = '''
\item More things
\subitem Anything
'''

subst = ""

print re.sub(p, subst, str)

結果將是：

More things
Anything

Answer 2

這將有效：

'\\\w+\s'

它搜索反斜杠，然后搜索一個或多個字符和空格。

來自乳膠線的REGEX解析命令 - Python

問題描述

我試過的

例

2 個解決方案

解決方案1
2 已采納 2014-05-05 22:24:50

解決方案2
0 2014-05-05 22:15:23

來自乳膠線的REGEX解析命令 - Python

問題描述

我試過的

例

2 個解決方案

解決方案1 2 已采納 2014-05-05 22:24:50

解決方案2 0 2014-05-05 22:15:23

解決方案1
2 已采納 2014-05-05 22:24:50

解決方案2
0 2014-05-05 22:15:23