[英]How to write Spark data frame to xml file?
Sample : 样品:
scala> Frame.show()
|year| make|model| comment|blank|
|2012|Tesla| S| No comment| R|
|1997| Ford| E350|Go get one now th...| L|
|2015|Chevy| Volt| Try| M|
to 至
<item>
<'year'>2012<'/year'>
<'make'>Tesla<'/make'>
<'model'>S<'/mode'>
</item>
The simplest approach is to use XML writer from spark-xml
: 最简单的方法是使用
spark-xml
:
val path: String = ???
df.write.format("com.databricks.spark.xml")
.option("rootTag", "items")
.option("rowTag", "item")
.save(path)
If for some reason it doesn't fit your needs you can dump records individually and saveAsTextFile
: 如果由于某种原因它不符合您的需求,您可以单独转储记录并
saveAsTextFile
:
def dumpXML(row: Row): String = ???
df.rdd.map(dumpXML).saveAsTextFile(path)
You can add root element using for example mapPartitions
. 您可以使用例如
mapPartitions
添加根元素。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.