打印前更换 dataframe header

Question

我有一个 csv 看起来像：

bookId,bookName,author,year,genre,bookCount
1,book1,au1,1989,gen1,89
2,book2,au2,788,gen2,55
3,book3,au3,9799,gen1,7

当我阅读并使用以下命令将其打印到终端时：

df = pd.read_csv('some3.csv',index_col=0)
print(df)

我得到：

       bookName author  year genre  bookCount
bookId
1         book1    au1  1989  gen1         89
2         book2    au2   788  gen2         55
3         book3    au3  9799  gen1          7

（请注意 bookId 出现在不同的行中，如果有人也可以解释这一点，因为我是初学者，这会很有帮助）

但是，我想将 df 显示为：（自定义标题）

Book ID  Book Name  Author  Published Year   Genre  Book Count
1        book1      au1     1989             gen1   89
2        book2      au2     788              gen2   55
3        book3      au3     9799             gen1   7

有时像：（没有流派列）

Book ID  Book Name  Author  Published Year  Book Count
1        book1      au1     1989            89
2        book2      au2     788             55
3        book3      au3     9799            7

（通过用自定义替换 header ，有时如果需要省略几列）

另外，最后我想把这个 df 写到一个新的 csv 文件中，希望看起来像这样：

Book ID,Book Name,Author,Published Year,Genre,Book Count
1,book1,au1,1989,gen1,89
2,book2,au2,788,gen2,55
3,book3,au3,9799,gen1,7

我愿意向 pd.read_csv() 添加一些参数来替换 header。 （或在必要时完全更改此声明）。

我也可以创建一个新的 df 来复制值并添加自定义 header 或任何其他代码调整。

但我无法更改第一个（现有的）csv 文件。

我如何实现这一目标？

Answer 1

当您阅读 csv

df = pd.read_csv('some3.csv') 
# when you flag index col, it will read the first column as index , 
# that is why it is lower than other header

然后用rename替换列

df = df.rename(columns={'bookId' : 'Book ID',  ....})

然后写入 csv

df.to_csv('newfile.csv')

Answer 2

要更改 col 名称：

df = pd.DataFrame({'aa':[1,3], 'bb': [13,20]})
df.columns = ['a', 'b']
df

删除 col：

del df['column_name']

要打印到 CSV：

df.to_csv(r'Path where you want to store the exported CSV file\File Name.csv', index = False)

打印前更换 dataframe header

问题描述

2 个解决方案

解决方案1
1 2020-08-14 01:22:31

解决方案2
0 2020-08-14 01:24:08

打印前更换 dataframe header

问题描述

2 个解决方案

解决方案1 1 2020-08-14 01:22:31

解决方案2 0 2020-08-14 01:24:08

解决方案1
1 2020-08-14 01:22:31

解决方案2
0 2020-08-14 01:24:08