简体   繁体   中英

Merging multiple xml files to a csv file using azure Data Factory

I have established a data factory pipeling to download zip file(open https linked source Url reference: https://clinicaltrials.gov/AllPublicXML.zip ). Unzip the same files to a folder that has multiple files of xml format. Copy and Extracting Zipped XML files from HTTP Link Source to Azure Blob Storage using Azure Data Factory Now I want to merge all XML files in the folder to a CSV file. I am using copy activity inside the data factory. While importing schema I am getting the following error message

Input XML file 'input/ALLPublicXML.zip' is invalid with parsing error 'Data at the root level is invalid. Line 1, position 1.'. Data at the root level is invalid. Line 1, position 1. . Activity ID: 358b3545-f49c-4bba-8adc-13fb3de9bb20 在此处输入图像描述

Not exactly sure what is going wrong, but it would be really helpful if someone can guide me with the procedure.

This seems to be an encoding issue, can you check the file once for encoding, it should follow the below encoding formats:

在此处输入图像描述

There might be some extra characters in file which are not allowing to read, more information can be found here

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM