与xpath，Lxml等效的Findall

Question

I am extracting text with respect to tags and I need to get them in a list form wrt p tags. 我正在提取有关标签的文本，我需要以wrt p标签的列表形式获取它们。 I have this xpath expression as: 我有这个xpath表达式为：

 find =  etree.XPath("//w:p//.//*[local-name() = 'ins']//text()" ,namespaces={'w':"http://schemas.openxmlformats.org/wordprocessingml/2006/main"})

And i want to use it in a findall expression. 我想在findall表达式中使用它。 I tried: 我试过了：

inserted_list_1=[]
for p in lxml_tree.findall('.//{' + w + '}p'):
    inserted_list_1.append([t.text for t in p.findall('.//{' + w + '}ins')])

but all this returns is a list full of None values whilst the former xpath works perfectly. 但是所有返回的结果都是一个None值的列表，而以前的xpath可以完美运行。
I think there's some intermediate path missing. 我认为缺少一些中间路径。

Answer 1

You cannot use that expression with findall() ; 您不能将该表达式与findall() ； the findall() method deliberately keeps compatibility with the limited ElementTree API XPath support . findall()方法故意与有限的ElementTree API XPath支持保持兼容性。

Use the xpath() method instead: 使用xpath()方法代替：

for p in lxml_tree.xpath('.//w:p', namespaces={'w': w}):

and just use namespace prefixes for much more readable queries. 并仅使用名称空间前缀进行可读性更高的查询。

If you just wanted to extract all contained text, you can use: 如果您只想提取所有包含的文本，则可以使用：

[t for t in p.xpath('../w:p//w:ins//text()',namespaces={'w': w})]

与xpath，Lxml等效的Findall

问题描述

1 个解决方案

解决方案1
4 已采纳 2014-10-01 10:30:18

与xpath，Lxml等效的Findall

问题描述

1 个解决方案

解决方案1 4 已采纳 2014-10-01 10:30:18

解决方案1
4 已采纳 2014-10-01 10:30:18