在熊猫中读取带有额外行的csv文件

Question

I'm trying to use pandas to manipulate a .txt file but I have extraline as shown in the picture below:我正在尝试使用 Pandas 来操作 .txt 文件，但我有如下图所示的 extraline：

When i read the file当我阅读文件时

import pandas as pd

df=pd.read_csv('movies.txt',sep='|')
print(df)

I got this as an output:我得到了这个作为输出：

                                                          +--------+--------------------------+------------+---------------+
NaN                                                id       MovieNAme                  Year         Author                                                         NaN       

+--------+--------------------------+----------... NaN      NaN                        NaN          NaN                                                            NaN       

NaN                                                1234     once upon deadpool                 2017 Alicia                                                         NaN       

+--------+--------------------------+----------... NaN      NaN                        NaN          NaN                                                            NaN       

NaN                                                1244     avengers: endgame                  2014 John                                                           NaN       

+--------+--------------------------+----------... NaN      NaN                        NaN          NaN                                                            NaN       

NaN                                                1245     The bird King                      2017 Mark                                                           NaN       

+--------+--------------------------+----------... NaN      NaN                        NaN          NaN                                                            NaN

How can i fix this please and remove this line "---------------------+----------"我该如何解决这个问题并删除这一行“---------------------+---------”

Answer 1

Try:尝试：

df = pd.read_csv(
    "name_of_your_file.txt",
    sep=r"\s*\|\s*",
    comment="+",
    usecols=range(1, 5),
    engine="python",
)
print(df)

Answer 2

Usually, when working with csv we create empty spaces but pandas consider it as a line.通常，在使用 csv 时，我们会创建空白空间，但 Pandas 将其视为一条线。 do it's always better to add DataFrameName.dropna(axis=0, inplace=False) to drop those empty cells or you can just go to csv and do it manually.添加DataFrameName.dropna(axis=0, inplace=False)来删除那些空单元格总是更好，或者您可以直接转到 csv 并手动执行。

在熊猫中读取带有额外行的csv文件

问题描述

2 个解决方案

解决方案1
1 已采纳 2021-07-01 11:38:40

解决方案2
-1 2021-07-01 11:38:50

在熊猫中读取带有额外行的csv文件

问题描述

2 个解决方案

解决方案1 1 已采纳 2021-07-01 11:38:40

解决方案2 -1 2021-07-01 11:38:50

解决方案1
1 已采纳 2021-07-01 11:38:40

解决方案2
-1 2021-07-01 11:38:50