pandas.read_excel 列作为具有 NA 值的 int64

Question

I'm trying to import an excel spreadsheet onto pandas using:我正在尝试使用以下方法将 excel 电子表格导入到 Pandas 中：

df= pd.read_excel(excel_file)

This reads integer columns fine as int64, as long as they don't have missing or nan values.只要它们没有缺失值或 nan 值，这会将整数列读取为 int64。 When they do have nan values it is read as float64.当它们确实有 nan 值时，它被读取为 float64。

I have tried using:我试过使用：

df= pd.read_excel(excel_file, converters={'column_x': np.int64, 'column_y': np.int64})

and和

df= pd.read_excel(excel_file, dtype={'column_x': np.int64, 'column_y': np.int64})

I'd like to keep the missing values as nan but the other values as int.我想将缺失值保留为 nan，而将其他值保留为 int。 Is this possible?这可能吗？

Thanks谢谢

Answer 1

是的，在带有Nullable 整数数据类型的Pandas 0.24+ 中：

df= pd.read_excel(excel_file, dtype={'column_x': 'Int64', 'column_y': 'Int64'})

pandas.read_excel 列作为具有 NA 值的 int64

问题描述

1 个解决方案

解决方案1
2 已采纳 2020-02-05 14:04:32

pandas.read_excel 列作为具有 NA 值的 int64

问题描述

1 个解决方案

解决方案1 2 已采纳 2020-02-05 14:04:32

解决方案1
2 已采纳 2020-02-05 14:04:32