Python web data parse

Question

i've been trying to parse data from a website that requires login so i've been using this code below

import requests
from lxml import html
session_requests = requests.session()
payload = {
    "login-username": "myusername", 
    "login-password": "mypassword"
}
login_url = "https://oprewards.com/login"
result = session_requests.get(login_url)

tree = html.fromstring(result.text)
result = session_requests.post(
    login_url, 
    data = payload, 
    headers = dict(referer=login_url)
)
url = 'https://oprewards.com/profile'
result = session_requests.get(
    url, 
    headers = dict(referer = url)
)

print(result.content)

but it isn't working, I'm not so good at Python so i wish that I can get help there, thanks.

Answer 1

Thanks for asking this question.

One thing right off the bat is you'll want to check out where actually the login occurs. If you open the network tab, it doesn't send a request to the page that it shows the user, but a different url:

https://oprewards.com/ASEngine/ASAjax.php

I think once you investigate what urls you send your data to you can construct a more accurate request to log yourself in.

However, if you want to login exactly as a normal user would (that is, by entering in a user/password and clicking the "Login" button, I'd suggest using a browser-automation tool, such as Selenium Webdriver for python: https://selenium-python.readthedocs.io/getting-started.html

Python web data parse

Question

1 answers

solution1
1 2020-02-08 01:15:29

Python web data parse

Question

1 answers

solution1 1 2020-02-08 01:15:29

solution1
1 2020-02-08 01:15:29