Spark avoid partition overwrite

Question

I am writing a Spark Application that saves log data into a directory /logroot .

My code is

myDF.mode('overwrite').partitionBy('date','site').save('logroot')

I want to use the overwrite mode in order to re-process many times a week all the daily data.

My concern is that overwrite cleans all the logroot directory and not only the partitions involved.

How can I solve this problem?

Answer 1

At the moment of writing the best solution seems:

Thanks to all for the help and hope Spark guys will provide a more elegant solution option.

Roberto