简体   繁体   中英

How to read excel files from online zip link in pandas

I have an online link which is updated with a zip file everyday. The zip file contains a folder and within that the xls file I want to read into pandas

I tried using zipfile module.

zf = zipfile.ZipFile('http://xxxxx/xxxx/xxxxx/xxxxx.zip')

But it gave an error:

IOError: [Errno 22] invalid mode ('rb') or filename: ' http://xxxxx/xxxx/xxxxx/xxxxx.zip '

Also only read csv seems to have compression attribute

How do I achieve this?

You can use urllib and io :

import zipfile
from urllib.request import urlopen
# from urllib import urlopen  # for python 2

import io

zipfile.ZipFile(io.BytesIO(urlopen(url).read()))

As another option you can pass compression='gzip' argument into pd.read_csv method.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM