selecting second child in beautiful soup

Question

Let say a have:

<div>
    <p>this is some text</p>
    <p>...and this is some other text</p>
</div>

How can I retrieve the text from the second paragraph in beautifulsoup?

Answer 1

You can use a CSS selector to do this:

>>> from bs4 import BeautifulSoup

>>>  soup = BeautifulSoup("""<div>
.... <p>this is some text</p>
.... <p>...and this is some other text</p>
.... </div>""", "html.parser")

>>>  soup.select('div > p')[1].get_text(strip=True)
     '...and this is some other text'

Answer 2

You can use nth-of-type :

h = """<div>
    <p>this is some text</p>
    <p>...and this is some other text</p>
</div>"""


soup = BeautifulSoup(h)

print(soup.select_one("div p:nth-of-type(2)").text)

Answer 3

secondp = [div.find('p') for div in soup.find('div')]

In : secondp[1].text

Out : Your text

Or you can use the findChildren directly -

div_ = soup.find('div').findChildren()
for i, child in enumerate(div_):
    if i == 1:
         print child.text

Answer 4

You could solve this with gazpacho :

from gazpacho import Soup

html = """\
<div>
    <p>this is some text</p>
    <p>...and this is some other text</p>
</div>
"""

soup = Soup(html)
soup.find('p')[1].text

Which would output:

'...and this is some other text'

selecting second child in beautiful soup

Question

4 answers

solution1
28 ACCPTED 2016-07-06 21:24:32

solution2
14 2016-07-06 21:28:06

solution3
3 2016-07-06 21:20:14

solution4
-1 2020-10-09 22:36:31

selecting second child in beautiful soup

Question

4 answers

solution1 28 ACCPTED 2016-07-06 21:24:32

solution2 14 2016-07-06 21:28:06

solution3 3 2016-07-06 21:20:14

solution4 -1 2020-10-09 22:36:31

solution1
28 ACCPTED 2016-07-06 21:24:32

solution2
14 2016-07-06 21:28:06

solution3
3 2016-07-06 21:20:14

solution4
-1 2020-10-09 22:36:31