Unable to open the link in proxy

Question

I am actually using a proxy to scrape data from some sites but the problem is sometimes some proy url returns nothing and programmed stopped after a few tries, I need some logic to overcome this issue so that even if IP does not respond program should renew the IP and try to open the page again, I am using TOR as a proxy in python.

Here is my website opening code:

mainPage = requests.get("http://proxy_IP/?link=http://example.com/")
mainTree = html.fromstring(mainPage.text)

Answer 1

You can simply put your code in while loop and give it certain condition, when that condition becomes TRUE, it means your page is properly opened.

mainPage = requests.get("http://proxy_IP/?link=http://example.com/")
mainTree = html.fromstring(mainPage.text)

mainTree
while (mainTree.xpath('boolean(some_xpath_to_be_true])') != True):
    mainPage = requests.get("http://proxy_IP/?link=http://example.com/")
    mainTree = html.fromstring(mainPage.text)

Now your mainTree contains the page source correctly.

Unable to open the link in proxy

Question

1 answers

solution1
0 ACCPTED 2016-07-27 10:57:51

Unable to open the link in proxy

Question

1 answers

solution1 0 ACCPTED 2016-07-27 10:57:51

solution1
0 ACCPTED 2016-07-27 10:57:51