Pyspark: how to read a .csv file?

Question

I am trying to read a.csv file that has a strange format.

This is what I am doing

df =  spark.read.format('csv').option("header", "true").option("delimiter", ',').load("muyFile.csv"))
df.show(5)

I do not understand why the lonlat entry of the third id is transposed. It seems that the file has two different delimiters. Your help would be much appreciated!

Answer 1

your tag field probably contains comma as a value which is treated as the delimiter. enclose your data in quotes or any other quote char(remember to set.option('quote','')) and read the data again. It should work

Pyspark: how to read a .csv file?

Question

1 answers

solution1
0 2020-05-08 08:54:17

Pyspark: how to read a .csv file?

Question

1 answers

solution1 0 2020-05-08 08:54:17

solution1
0 2020-05-08 08:54:17