python hdfs writer增加了一个额外的索引列以输出csv

Question

I need to write a csv to hdfs. 我需要将csv写入hdfs。 Currently i am using hdfs module for doing that. 目前，我正在使用hdfs模块来执行此操作。

df = pd.DataFrame(data, columns = ['FirstName','LastName','City'])
d = InsecureClient('http://localhost:50070')
with d.write(path, encoding = 'utf-8', overwrite=True) as writer:
    df.to_csv(writer)

file is succesfully generated, but it adds and extra index column as starting to the csv. 文件已成功生成，但是它添加了额外的索引列作为csv的开始。 I need to have the file with the columns i have specified. 我需要具有指定的列的文件。 how i can remove this index? 我如何删除该索引？ i could not find any parameter for that. 我找不到任何参数。

#current output:
,"FirstName","LastName","City"
0,"John","Doo","New York" 
1,"Jane","Doo","San Francisco"

#expected:
"FirstName","LastName","City"
"John","Doo","New York"
"Jane","Doo","San Francisco"

thanks in advance, clairvoyant 在此先感谢，千里眼

update: using pandas package to create my csv 更新：使用熊猫包创建我的csv

Answer 1

调用to_csv时将index=False设置（默认为True ）。

python hdfs writer增加了一个额外的索引列以输出csv

问题描述

1 个解决方案

解决方案1
0 已采纳 2017-11-26 20:43:55

python hdfs writer增加了一个额外的索引列以输出csv

问题描述

1 个解决方案

解决方案1 0 已采纳 2017-11-26 20:43:55

解决方案1
0 已采纳 2017-11-26 20:43:55