pandas read_csv 刪除空白行

Question

我在定義每列的數據類型時將 CSV 文件作為DataFrame讀取。 如果 CSV 文件中有空行，此代碼會給出錯誤。 如何在沒有空白行的情況下讀取 CSV？

dtype = {'material_id': object, 'location_id' : object, 'time_period_id' : int, 'demand' : int, 'sales_branch' : object, 'demand_type' : object }

df = pd.read_csv('./demand.csv', dtype = dtype)

我想到了一種解決方法來做這樣的事情，但不確定這是否是有效的方法：

df=pd.read_csv('demand.csv')
df=df.dropna()

然后重新定義df的列數據類型。

編輯：代碼-

import pandas as pd
dtype1 = {'material_id': object, 'location_id' : object, 'time_period_id' : int, 'demand' : int, 'sales_branch' : object, 'demand_type' : object }
df = pd.read_csv('./demand.csv', dtype = dtype1)
df

錯誤 - ValueError: Integer column has NA values in column 2

我的 CSV 文件的快照 -

Answer 1

這對我有用。

def delete_empty_rows(file_path, new_file_path):
    data = pd.read_csv(file_path, skip_blank_lines=True)
    data.dropna(how="all", inplace=True)
    data.to_csv(new_file_path, header=True)

Answer 2

像這樣嘗試：

data = pd.read_table(filenames,skip_blank_lines=True, a_filter=True)

Answer 3

解決方案可能是：

data = pd.read_table(filenames,skip_blank_lines=True, na_filter=True)

Answer 4

嘗試.csv

s,v,h,h
1,2,3,4

4,5,6,7



9,10,1,2

Python代碼

df = pd.read_csv('try.csv', delimiter=',')
print(df)

輸出

   s   v  h  h.1
0  1   2  3    4
1  4   5  6    7
2  9  10  1    2

Answer 5

我不確定它是否有效，但它有效。 此代碼在讀取 csv 時不會加載 nan 值。

df=pd.read_csv('demand.csv').dropna()

pandas read_csv 刪除空白行

問題描述

5 個解決方案

解決方案1
4 2019-07-20 20:10:57

解決方案2
1 2018-09-02 09:12:26

解決方案3
0 2020-06-29 15:46:27

解決方案4
-3 2018-09-02 09:13:08

解決方案5
-3 2020-07-12 17:50:42

pandas read_csv 刪除空白行

問題描述

5 個解決方案

解決方案1 4 2019-07-20 20:10:57

解決方案2 1 2018-09-02 09:12:26

解決方案3 0 2020-06-29 15:46:27

解決方案4 -3 2018-09-02 09:13:08

解決方案5 -3 2020-07-12 17:50:42

解決方案1
4 2019-07-20 20:10:57

解決方案2
1 2018-09-02 09:12:26

解決方案3
0 2020-06-29 15:46:27

解決方案4
-3 2018-09-02 09:13:08

解決方案5
-3 2020-07-12 17:50:42