简体   繁体   English

使用 azure 数据工厂将多个 xml 文件合并到 csv 文件

[英]Merging multiple xml files to a csv file using azure Data Factory

I have established a data factory pipeling to download zip file(open https linked source Url reference: https://clinicaltrials.gov/AllPublicXML.zip ). I have established a data factory pipeling to download zip file(open https linked source Url reference: https://clinicaltrials.gov/AllPublicXML.zip ). Unzip the same files to a folder that has multiple files of xml format.将相同的文件解压缩到包含多个 xml 格式文件的文件夹中。 Copy and Extracting Zipped XML files from HTTP Link Source to Azure Blob Storage using Azure Data Factory Now I want to merge all XML files in the folder to a CSV file. Copy and Extracting Zipped XML files from HTTP Link Source to Azure Blob Storage using Azure Data Factory Now I want to merge all XML files in the folder to a CSV file. I am using copy activity inside the data factory.我在数据工厂内使用复制活动。 While importing schema I am getting the following error message导入架构时,我收到以下错误消息

Input XML file 'input/ALLPublicXML.zip' is invalid with parsing error 'Data at the root level is invalid. Line 1, position 1.'. Data at the root level is invalid. Line 1, position 1. . Activity ID: 358b3545-f49c-4bba-8adc-13fb3de9bb20 在此处输入图像描述

Not exactly sure what is going wrong, but it would be really helpful if someone can guide me with the procedure.不完全确定出了什么问题,但是如果有人可以指导我进行该程序,那将非常有帮助。

This seems to be an encoding issue, can you check the file once for encoding, it should follow the below encoding formats:这似乎是一个编码问题,你可以检查一次文件的编码,它应该遵循以下编码格式:

在此处输入图像描述

There might be some extra characters in file which are not allowing to read, more information can be found here文件中可能有一些额外的字符不允许读取,更多信息可以在这里找到

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM