i am unable to scrape the phone number . Can anyone help me

Question

Here is the html content of that part.

<p class="contact-info " onclick="_ct('clntphn', 'lspg');">
  <i class="contactNo spriteImg"></i>
  <span><a><b>+(91)-80-30805680</b></a></span>
</p>

I have tried extracting this by:

soup.find('p',{'class':'contact-info'})

but in vain . It is unable to do the same.

Answer 1

You can use the find method with tag:

from bs4 import BeautifulSoup as soup
s = soup(html_data, "lxml")
number = s.find('b').text

Output:

u'+(91)-80-30805680'

To get the first instance, you can try this:

new_s = s.findAll("p", {"class":"contact-info"})
new_data = [i.text for i in new_s]
print(new_data[0].replace("\n", ''))

Output:

+(91)-80-30805680

Answer 2

您需要将您的 find 方法加入链中：

soup.find('p',{'class':'contact-info'}).find('b').text

Answer 3

I would use a CSS selector. As there is some whitespace around it, we can easily strip it.

soup.select_one('p.contact-info').get_text(strip=True)