简体繁体中英

using caseclass versus structtype in spark scala

原文 2019-11-23 16:01:50 5 1 scala/ dataframe/ apache-spark

When should I use Structtype, and when should I use case class. I am trying to create a spark dataset. I have an input CSV file, I am trying to create a dataframe first and then convert it to the dataset using df.as[]. Now in order to generate the schema, should I use structtype or case class? Please help.

1 answers

You don't have to use StructType when reading your CSV file but :

By default all fields would be Strings unless you specify the inferschema option
You'd have to name every field like this if you don't have a header
sparkSession.read.csv("my/csv/path.csv").toDF("id","product","customer","time").as[Transaction]

Spark Scala: Cast StructType to String

Why eclipse thinks df.as[CaseClass] as an error in Scala Spark program?

reading from a spark.structType in scala

Spark - How to add a StructField at the beginning of a StructType in scala

Spark scala - Nested StructType conversion to Map

Flatten Parquet File with nested Arrays and StructType Spark Scala

-Spark Scala Mongodb- MongoTypeConversionException Cannot cast STRING into a StructType(…)

General processing on ListType, MapType, StructType fields of Spark dataframe in Scala in UDF?

How to convert map(key,struct) to map(key,caseclass) in spark scala dataframe

Cast values of a Spark dataframe using a defined StructType

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Spark Scala: Cast StructType to String Why eclipse thinks df.as[CaseClass] as an error in Scala Spark program? reading from a spark.structType in scala Spark - How to add a StructField at the beginning of a StructType in scala Spark scala - Nested StructType conversion to Map Flatten Parquet File with nested Arrays and StructType Spark Scala -Spark Scala Mongodb- MongoTypeConversionException Cannot cast STRING into a StructType(…) General processing on ListType, MapType, StructType fields of Spark dataframe in Scala in UDF? How to convert map(key,struct) to map(key,caseclass) in spark scala dataframe Cast values of a Spark dataframe using a defined StructType

Related Tags

using caseclass versus structtype in spark scala

Question

1 answers

solution1 0 2019-11-23 16:11:10

solution1
0 2019-11-23 16:11:10