简体繁体中英

Select element based on text inside Beautiful Soup

原文 2018-08-07 17:39:26 1 2 python/ beautifulsoup

I scrapped a website and I want to find an element based on the text written in it. Let's say below is the sample code of the website:

code = bs4.BeautifulSoup("""<div>
<h1>Some information</h1>
<p>Spam</p>
<p>Some Information</p>
<p>More Spam</p>
</div>""")

I want some way to get a p element that has as a text value Some Information . How can I select an element like so?

2 answers

Just use text parameter:

code.find_all("p", text="Some Information")

If you need only the first element than use find instead of find_all .

You could use text to search all tags matching the string

import BeautifulSoup as bs
import re
code = bs.BeautifulSoup("""<div>
<h1>Some information</h1>
<p>Spam</p>
<p>Some Information</p>
<p>More Spam</p>
</div>""")


for elem in code(text='Some Information'):
    print elem.parent

Using page text to select `html` element using`Beautiful Soup`

Python beautiful soup select text

Python Beautiful Soup 4 Get Children of Element with .select()

Beautiful Soup find href inside a child element

Extracting text :after an element with Beautiful Soup

Python beautiful soup select text in attrs

Targetting a text element with no tag in Beautiful Soup

Python Beautiful Soup: get text from element

Beautiful Soup finding href based on hyperlink Text

Beautiful Soup (Python) not seeing text inside of span

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Using page text to select `html` element using`Beautiful Soup` Python beautiful soup select text Python Beautiful Soup 4 Get Children of Element with .select() Beautiful Soup find href inside a child element Extracting text :after an element with Beautiful Soup Python beautiful soup select text in attrs Targetting a text element with no tag in Beautiful Soup Python Beautiful Soup: get text from element Beautiful Soup finding href based on hyperlink Text Beautiful Soup (Python) not seeing text inside of span

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM