从 python 中的 URL 读取 xls 文件

Question

I'm trying to read data from the link below in python https://drive.google.com/file/d/16cp23cJxeyUfnBHMp-sNCuFNQxe8cqOV/view我正在尝试从 python https://drive.google.com/file/d/16cp23cJxeyUfnBHMp-sNCuFNQxe8cqOV/view中的链接读取数据

I've tried this:我试过这个：

import pandas as pd

path = pd.read_excel('https://drive.google.com/file/d/16cp23cJxeyUfnBHMp-sNCuFNQxe8cqOV/view')

That returned this error:这返回了这个错误：

XLRDError: Unsupported format, or corrupt file: Expected BOF record; XLRDError：不支持的格式，或损坏的文件：预期的 BOF 记录； found b'<!DOCTYP'找到 b'<!DOCTYP'

Then I tried using cvs format然后我尝试使用 cvs 格式

path = pd.read_csv('https://drive.google.com/file/d/16cp23cJxeyUfnBHMp-sNCuFNQxe8cqOV/view')

Returned this退了这个

ParserError: Error tokenizing data. ParserError：错误标记数据。 C error: Expected 298 fields in line 133, saw 440 C 错误：预计第 133 行中有 298 个字段，看到 440

finnaly I tried this:最后我试过这个：

path = pd.read_csv("https://drive.google.com/file/d/16cp23cJxeyUfnBHMp-sNCuFNQxe8cqOV/view")

This read data but that is not what I expected after seen the link (283 rows, 7 columns).此读取数据，但这不是我在看到链接后所期望的（283 行，7 列）。 Photo below.下图。

Error reading data读取数据时出错

Any ideas of how could I read the data?关于如何读取数据的任何想法？

Thanks谢谢

Answer 1

Use this example to download the excel from Google Drive (the fileid is the ID after the /d/ part in your URL):使用此示例从 Google Drive 下载 excel（ fileid是 URL 中/d/部分后面的 ID）：

fileid = "16cp23cJxeyUfnBHMp-sNCuFNQxe8cqOV"

df = pd.read_excel(
    "https://drive.google.com/uc?export=download&id={fileid}".format(
        fileid=fileid
    ),
    skiprows=17,
)
print(df)

Prints:印刷：

     Unnamed: 0                                         Unnamed: 1                                         Unnamed: 2 Petajoules Gigajoules           %
0           NaN                                        Afghanistan                                        Afghanistan        321         10   78.669280
1           NaN                                            Albania                                            Albania        102         35  100.000000
2           NaN                                            Algeria                                            Algeria       1959         51    0.551010
3           NaN                                     American Samoa                                     American Samoa        ...        ...    0.641026
4           NaN                                            Andorra                                            Andorra          9        121   88.695650
5           NaN                                             Angola                                             Angola        642         27   70.909090

...and so on.

从 python 中的 URL 读取 xls 文件

问题描述

1 个解决方案

解决方案1
0 已采纳 2021-05-09 00:24:42

从 python 中的 URL 读取 xls 文件

问题描述

1 个解决方案

解决方案1 0 已采纳 2021-05-09 00:24:42

解决方案1
0 已采纳 2021-05-09 00:24:42