簡體 English 中英

lxml findall div和span標簽

[英]lxml findall div and span tags

原文 2013-03-15 12:41:21 5 2 python/ html-parsing/ lxml

如何找到保留訂單的所有div和span標簽。使用BeautifulSoup非常簡單： soup.findAll(name=['span', 'div']) ，但我最近切換到lxml，因為它比BeautifulSoup快得多。

2 個解決方案

import lxml.html as LH
content = '''\
<tr>
<div>idend</div>
<span>Green<\span>
<tr>
'''
root = LH.fromstring(content)
for tag in root.xpath('//*[self::div or self::span]'):
    print(tag)

產量

<Element div at 0xb751f23c>
<Element span at 0xb751f11c>

import lxml.html
from lxml.cssselect import CSSSelector
content = result.read()
page_html = lxml.html.fromstring(content)

elements = page_html.xpath('//*[self::div or self::span]')

要么

sd_selector = CSSSelector('span,div')
elements = sd_selector(page_html)

使用 xpath 使用 lxml findall() 查找多種類型的標簽？

[英]Finding multiple types of tags with lxml findall() with xpath?

Python lxml提取span標簽的值

[英]Python lxml extract value of span tags

lxml - 在findall（）中使用正則表達式按屬性值查找標簽

[英]lxml - using regex in findall() to find tags by attribute values

lxml xpath-獲取跨度標簽內的所有文本

[英]lxml xpath - Get all text within span tags

在Findall，Lxml中添加OR條件

[英]Adding an OR condition in Findall, Lxml

與xpath，Lxml等效的Findall

[英]Findall equivalent for xpath , Lxml

lxml findall和逆序

[英]lxml findall and reverse order

控制搜索深度findall Lxml

[英]Control search depth findall Lxml

lxml findall 語法錯誤：謂詞無效

[英]lxml findall SyntaxError: invalid predicate

Python LXML findall 然后給出路徑

[英]Python LXML findall then give a path

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 使用 xpath 使用 lxml findall() 查找多種類型的標簽？ Python lxml提取span標簽的值 lxml - 在findall（）中使用正則表達式按屬性值查找標簽 lxml xpath-獲取跨度標簽內的所有文本在Findall，Lxml中添加OR條件與xpath，Lxml等效的Findall lxml findall和逆序控制搜索深度findall Lxml lxml findall 語法錯誤：謂詞無效 Python LXML findall 然后給出路徑

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM