簡體   English   中英

lxml findall div和span標簽

[英]lxml findall div and span tags

如何找到保留訂單的所有div和span標簽。使用BeautifulSoup非常簡單: soup.findAll(name=['span', 'div']) ,但我最近切換到lxml,因為它比BeautifulSoup快得多。

import lxml.html as LH
content = '''\
<tr>
<div>idend</div>
<span>Green<\span>
<tr>
'''
root = LH.fromstring(content)
for tag in root.xpath('//*[self::div or self::span]'):
    print(tag)

產量

<Element div at 0xb751f23c>
<Element span at 0xb751f11c>
import lxml.html
from lxml.cssselect import CSSSelector
content = result.read()
page_html = lxml.html.fromstring(content)

elements = page_html.xpath('//*[self::div or self::span]')

要么

sd_selector = CSSSelector('span,div')
elements = sd_selector(page_html)

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM