Spark 2.4.0 writing empty dataframe to parquet AWS s3

Question

After Spark 2.4.0 EMR job writes empty DF to AWS S3:

df
  .repartition(1)
  .write
  .mode(SaveMode.Append)
  .partitionBy(/* some_partitions */ )
  .parquet(target)

There is no output at target S3 location. However, this is not what I'd expect based on this resolved issue . There is no exception but also there are no metadata and no _success file in target folder.

Thanks in advance!

Answer 1

How about writing to core node's hdfs ? Do you see files written there ?

Spark 2.4.0 writing empty dataframe to parquet AWS s3

Question

1 answers

solution1
-1 2019-11-28 21:08:57

Spark 2.4.0 writing empty dataframe to parquet AWS s3

Question

1 answers

solution1 -1 2019-11-28 21:08:57

solution1
-1 2019-11-28 21:08:57