Web scraping with Scrapy and BeautifulSoup

Question

<h2 class="result-item-name" data-nid="117" data-localisation="25.88872, -80.12488">
  Bal Harbour    </h2>

Hi everyone, I'm trying to collect the 'data-nid' and the 'data-localisation' but when I write my code :

'geocoordinates':['class','result-item-name','data-localisation']

I always get a None response.

Can you guys help me ? I am new to BeautifulSoup and I am not at ease with it.

Thank you very much !

Answer 1

In BeautifulSoup you can access the attributes using key-value. Just like a dictionary

Ex:

from bs4 import BeautifulSoup
html = """<h2 class="result-item-name" data-nid="117" data-localisation="25.88872, -80.12488">Bal Harbour    </h2>"""

soup = BeautifulSoup(html, "html.parser")
h2 = soup.find("h2", class_= "result-item-name")
print(h2["data-nid"])
print(h2["data-localisation"])

Output:

117
25.88872, -80.12488

Answer 2

from bs4 import BeautifulSoup

html = """
<h2 class="result-item-name" data-nid="117" data-localisation="25.88872, -80.12488">
Bal Harbour
</h2>
"""
soup = BeautifulSoup(html, 'html.parser')

h2 = soup.find('h2', {'class': 'result-item-name'})
print(h2['data-nid'])
print(h2['data-localisation'])

Output

117
25.88872, -80.12488

Web scraping with Scrapy and BeautifulSoup

Question

2 answers

solution1
0 ACCPTED 2018-07-09 09:50:33

solution2
0 2018-07-09 09:50:45

Web scraping with Scrapy and BeautifulSoup

Question

2 answers

solution1 0 ACCPTED 2018-07-09 09:50:33

solution2 0 2018-07-09 09:50:45

solution1
0 ACCPTED 2018-07-09 09:50:33

solution2
0 2018-07-09 09:50:45