Python Pandas: Skip rows by particular pattern (not row number) using pd.read_csv

Question

I'm trying to import the csv file into the Pandas DataFrame. However, here is the challenge, for example, I cannot use skiprows=9 , because the csv format is inconsistent from time to time, it will contain some useless information before the actual table begins.

Fortunately, before the table starts, there will always have a single row with the string "report field", and then the real table starts from the next line.

Is there any way I can skip all the rows until it catches the pattern "report field"?

Thanks.

Answer 1

df= pandas.read_csv("file.csv",header= None)
df_2= df.iloc[(df.loc[df[0]=='report field'].index[0]+1):, :].reset_index(drop = True)

So, the above line searches for "report field" value in "0" column of "df" dataframe and then picks up data from the next row to the last row in the "file.csv" file

Python Pandas: Skip rows by particular pattern (not row number) using pd.read_csv

Question

1 answers

solution1
0 2017-06-28 11:01:01

Python Pandas: Skip rows by particular pattern (not row number) using pd.read_csv

Question

1 answers

solution1 0 2017-06-28 11:01:01

solution1
0 2017-06-28 11:01:01