简体   繁体   English

使用Scala Spark将EDI格式转换为CSV?

[英]Convert EDI format to csv using scala spark?

如何使用spark或scala将EDI格式文件转换为CSV文件?

You can use a tool like this to create a mapping from EDI format to CSV and then generate a code in that tool. 您可以使用一个工具,像这样来创建EDI格式CSV映射,然后生成在该工具中的代码。 This code then can be used to convert EDI to CSV in Spark. 然后,可以使用此代码在Spark中将EDI转换为CSV。

For open source solutions, I think your best bet is EDI Reader from BerryWorks . 对于开源解决方案,我认为最好的选择是BerryWorks的EDI Reader Haven't tried it myself, but apparently this is what Hortonworks recommends , and I'd trust their judgement in the Big Data area. 我自己还没有尝试过,但是显然这是Hortonworks建议的 ,我相信他们在大数据领域的判断。 I'm not involved with either, for the matters of disclosure. 就披露而言,我也不参与。

From there, it's still a matter of converting EDI XML representation to CSV. 从那里开始,仍然需要将EDI XML表示转换为CSV。 Given that XML processing is not part of vanilla Spark, again, your options are rather limited here. 同样,由于XML处理不属于Vanilla Spark的一部分,因此,此处的选择相当有限。 Try Databricks spark-xml maybe? 试试Databricks spark-xml吗?

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM