简体繁体中英

Using BeautifulSoup to extract <span> WITH tags

原文 2015-04-02 19:20:20 3 1 python/ beautifulsoup

How can I properly extract the value of a <span> WITH the <br/> tags?

ie

from bs4 import BeautifulSoup

html_text = '<span id="spamANDeggs">This is<br/>what<br/>I want. WITH the <br/> tags.</span>'

soup = BeautifulSoup(html_text)

text_wanted = soup.find('span',{'id':'spamANDeggs'}).GetText(including<br/>...)

1 answers

You can use decode_contents() method just like this:

from bs4 import BeautifulSoup

html_text = '<span id="spamANDeggs">This is<br/>what<br/>I want. WITH the <br/> tags.</span>'
soup = BeautifulSoup(html_text)
text_wanted = soup.find('span', {'id': 'spamANDeggs'}).decode_contents(formatter="html")

Now text_wanted equals "This is<br/>what<br/>I want. WITH the <br/> tags."

Extracting span tags using Beautifulsoup

Python BeautifulSoup extract text from SPAN and A tags

Using BeautifulSoup to extract span text

Parsing nested span tags using beautifulsoup

How to scrape between span tags using beautifulsoup

Text extract from two consequent span tags with beautifulsoup

BeautifulSoup: How to extract text encapsulated in multiple div/span/id tags

Using BeautifulSoup to extract text without tags

Python: failed to get all the text in all the <span> tags using BeautifulSoup

Trying to find all of the text between multiple span tags using Beautifulsoup

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Extracting span tags using Beautifulsoup Python BeautifulSoup extract text from SPAN and A tags Using BeautifulSoup to extract span text Parsing nested span tags using beautifulsoup How to scrape between span tags using beautifulsoup Text extract from two consequent span tags with beautifulsoup BeautifulSoup: How to extract text encapsulated in multiple div/span/id tags Using BeautifulSoup to extract text without tags Python: failed to get all the text in all the <span> tags using BeautifulSoup Trying to find all of the text between multiple span tags using Beautifulsoup

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM