繁体 English 中英

lxml findall div和span标签

[英]lxml findall div and span tags

原文 2013-03-15 12:41:21 8 2 python/ html-parsing/ lxml

如何找到保留订单的所有div和span标签。使用BeautifulSoup非常简单： soup.findAll(name=['span', 'div']) ，但我最近切换到lxml，因为它比BeautifulSoup快得多。

2 个解决方案

import lxml.html as LH
content = '''\
<tr>
<div>idend</div>
<span>Green<\span>
<tr>
'''
root = LH.fromstring(content)
for tag in root.xpath('//*[self::div or self::span]'):
    print(tag)

产量

<Element div at 0xb751f23c>
<Element span at 0xb751f11c>

import lxml.html
from lxml.cssselect import CSSSelector
content = result.read()
page_html = lxml.html.fromstring(content)

elements = page_html.xpath('//*[self::div or self::span]')

要么

sd_selector = CSSSelector('span,div')
elements = sd_selector(page_html)

使用 xpath 使用 lxml findall() 查找多种类型的标签？

[英]Finding multiple types of tags with lxml findall() with xpath?

Python lxml提取span标签的值

[英]Python lxml extract value of span tags

lxml - 在findall（）中使用正则表达式按属性值查找标签

[英]lxml - using regex in findall() to find tags by attribute values

lxml xpath-获取跨度标签内的所有文本

[英]lxml xpath - Get all text within span tags

在Findall，Lxml中添加OR条件

[英]Adding an OR condition in Findall, Lxml

与xpath，Lxml等效的Findall

[英]Findall equivalent for xpath , Lxml

lxml findall和逆序

[英]lxml findall and reverse order

控制搜索深度findall Lxml

[英]Control search depth findall Lxml

lxml findall 语法错误：谓词无效

[英]lxml findall SyntaxError: invalid predicate

Python LXML findall 然后给出路径

[英]Python LXML findall then give a path

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用 xpath 使用 lxml findall() 查找多种类型的标签？ Python lxml提取span标签的值 lxml - 在findall（）中使用正则表达式按属性值查找标签 lxml xpath-获取跨度标签内的所有文本在Findall，Lxml中添加OR条件与xpath，Lxml等效的Findall lxml findall和逆序控制搜索深度findall Lxml lxml findall 语法错误：谓词无效 Python LXML findall 然后给出路径

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM