Beautiful Soup not finding CSS selector

Question

I am using Beautiful Soup and Requests to try to scrape data from a website, and am having difficulties pulling data with a certain CSS selector. I am using SelectorGadget ( https://selectorgadget.com/ ) to identify the selector I'm looking for at this site: https://www.oddsshark.com/ncaab/odds .op-bovada.\\lv is returned by it. However, this does not work, and neither does escaping the backslash. I've tried multiple variants of this, along with hunting around online, and haven't had any luck. I'm a bit of a python beginner, so I have a hunch I'm overlooking something obvious.

This code reproduces the issue that I'm running into.

import requests, bs4
res = requests.get('https://www.oddsshark.com/ncaab/odds')
odds = bs4.BeautifulSoup(res.text, 'html.parser')

# This, another identifier from the same site, works fine.
print(str(len(odds.select('.op-opening'))))

# However, this does not.
print(str(len(odds.select('.op-bovada.\lv'))))
print(str(len(odds.select('.op-bovada.\\lv'))))

I've had no problems doing this in R - it just needed double backslashes, so I know there's data there, but I'm beating my head against a wall in Python at the moment.

Answer 1

Why not use beautiful soups find method?

print(len(odds.find_all(class_='op-bovada.lv')))

The problem is arising from the way that select parses the string being passed in. the . identifies a class, however in this case the . is part of the class so the parser does not interpret it correctly. By passing the . into the class argument of find_all instead you get the desired effect.

Beautiful Soup not finding CSS selector

Question

1 answers

solution1
1 ACCPTED 2018-12-03 18:06:56

Beautiful Soup not finding CSS selector

Question

1 answers

solution1 1 ACCPTED 2018-12-03 18:06:56

solution1
1 ACCPTED 2018-12-03 18:06:56