简体   繁体   中英

Write parquet data to a file on S3 with scala

有没有一种方法可以使用Scala在S3上快速稳定地写入镶木地板数据?

是的-使用Spark一旦拥有一个数据df.write.parquet("s3a://newfilename")就可以使用df.write.parquet("s3a://newfilename")

you can work with s3a long haul from a standalone installation if you want to, though I'd recommend against using it as the direct destination of work because of performance issues. Better to output locally then copy it there (hadoop distcp does this).

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM