简体   繁体   中英

Web Scraping using webbot

I am trying to create a simple program to login to a webpage using my credentials and grab the total amount left of flex dollars I have in my account for college. Starting at the log in page, I log in, and am redirected to the page of interest, and I simply want to grab that dollar amount and perform some manipulation on it.

I am currently using webbot for the login portion of this, which works, I have just redacted the credentials:

from webbot import Browser

web = Browser()
web.go_to('insert my url here')
#enter your username and password in the into fields below
web.type('insert email here', into='username')
web.type('insert password here', into='password')
web.click('Login', tag='span')

This works perfectly so far, creating an instance of Chrome and logging into the page I want to grab the dollar amount from. I imagine I might want to proceed using urllib, however, I don't think urllib benefits from my current logged in instance of Chrome. How can I work around this and grab a simple html element from the page?

You first need to get the html source code for the current webpage. You can do that using get_page_source() . You then need to pass the html source code to beautifulsoup

from webbot import Browser
from bs4 import BeautifulSoup
import time

web = Browser()
web.go_to('insert my url here')
#enter your username and password in the into fields below
web.type('insert email here', into='username')
web.type('insert password here', into='password')
web.click('Login', tag='span')
time.sleep(5)

content = web.get_page_source()
soup = BeautifulSoup(content)

#You can now find the element you want
samples = soup.find_all("a", "item-title")

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM