How to read JSON file in Spark Scala?

Question

I have a JSON file I want to read using Spark Scala, but when I read that file as DF it shows "_corrupt_record" column, and I tried all possible ways.

val df = spark.read
  .format("json")
  .option("multiline","true")
  .load("PATH")

+--------------------+
|     _corrupt_record|
+--------------------+
|                   {|
|    "name": "Candace|
|  "phone": "1-355...|
|  "email": "egest...|
|  "address": "160...|
|  "postalZip": "1...|
|  "rankings": "9,...|
|  "alphanumeric":...|
|                  },|
|                   {|
|  "name": "Grant ...|
|  "phone": "(884)...|
|  "email": "magna...|
|  "address": "P.O...|
| "postal Zip": "6...|
|  "rankings": "9,...|
|  "alphanumeric":...|
|                  },|
|                   {|
|    "name": "Patrice|
+--------------------+

Answer 1

Using the JSON example which you provided in the comments,

[ { "firstName": "Joe", "lastName": "Jackson", "gender": "male", "age": 28, "address": { "streetAddress": "101", "city": "San Diego", "state": "CA" }, "phoneNumbers": { "type": "home", "number": "7349282382" } } ]

the following line works well:

val df = spark.read.json("your/path")

How to read JSON file in Spark Scala?

Question

1 answers

solution1
0 2022-10-04 21:34:01

How to read JSON file in Spark Scala?

Question

1 answers

solution1 0 2022-10-04 21:34:01

solution1
0 2022-10-04 21:34:01