How to fetch data from Skyscanner?

Question

I am new to Python and there has been a request for grabbing the dynamic data from www.skyscanner.net .

Can someone guide me on doing so?

import requests
import lxml.html as lh

url = 'http://www.skyscanner.net/transport/flights/sin/lhr/131231/140220/'
response = requests.post(url)

tree = lh.document_fromstring(response.content)
print(tree);

All I did was to find the pattern in URL and attempt to grab from there. However, no data were successfully pulled. I learnt that Python was the best language in doing such task, but the library seems too huge and I do not know where to start form.

Answer 1

My name is Piotr - I work for Skyscanner - in Data Acquisition team - which I assume that you are applying to join :-) As this is a part of your task I wouldn't like to give you a straight answer , however you might consider:

Understand how our site works - how the requests are built and what data you can find in the http response.
You could use some libraries that will help you parsing xml/json responses

I think that's all I can say :-)

Cheers, piotr

How to fetch data from Skyscanner?

Question

1 answers

solution1
0 2013-11-19 16:29:51

How to fetch data from Skyscanner?

Question

1 answers

solution1 0 2013-11-19 16:29:51

solution1
0 2013-11-19 16:29:51