How to download text file from website using Python?

Question

I need to write a function that downloads and stores the today's list of pre-release domains .txt file from http://www.namejet.com/pages/downloads.aspx. So as today is 8th of October you want to get the file "Monday, October 08, 2012". Tried with requests but didn't work. I'm having trouble because the file is not stored on a fixed URL but is hidden behind some Javascript.

Answer 1

This one's a little tricky as you're dealing with ASP.NET's postback system. If this is for anything other than a personal script, I'd be wary as you're effectively not only using another site's data, but reverse engineering their software as well (however, IANAL and have no idea about legalities around these issues in web systems).

What you're going to want to do is check the POST data (using Firebug, Chrome developer tools, etc) and look for the __EVENTTARGET and __VIEWSTATE attributes of the form object. You'll have to decode the __VIEWSTATE to be readable (check out http://ignatu.co.uk/ViewStateDecoder.aspx ). From there, I think you should be able to figure out how to get the data you're looking for.

From Python, it's as easy as:

from urllib2 import urlopen
from urllib import urlencode

data = urlopen('url', urlencode({
    '__VIEWSTATE': 'foo',
    '__EVENTTARGET': 'bar',
})).read()

Answer 2

Actually you get text file in response to a POST request with several base64-encoded request parameters. Feel free to play with it

use Firebug or any other debug tool to see the POST content and parameters

How to download text file from website using Python?

Question

2 answers

solution1
2 ACCPTED 2012-10-08 06:25:33

solution2
1 2012-10-08 05:30:38

How to download text file from website using Python?

Question

2 answers

solution1 2 ACCPTED 2012-10-08 06:25:33

solution2 1 2012-10-08 05:30:38

solution1
2 ACCPTED 2012-10-08 06:25:33

solution2
1 2012-10-08 05:30:38