简体   繁体   中英

How to uncompress rar files using Azure DataFactory

We have a new client, while landing the project we gave them a blob storage for them to leave files so we could later automate and process the information.

The idea is to use Azure Datafactory but we find no way of dealing with.rar files, and even.zip, being it files from windows, are giving us trouble. And since it is the clien giving the.rar format, we wanted to make absolutely sure there is no way to process before asking them to change it, or deploying a databricks or similar service just for the purpose of transforming the file.

Is there any way to get a.rar file from a blob storage, uncompress it, then process it?

I have been looking in posts like this and related official documentation and closest we have come is using ZipDeflate, but it does not seem to fill our requirement.

Thanks in advance!

Data factory compression only supported types are GZip, Deflate, BZip2, and ZipDeflate.

For the Unsupported file types and compression formats , Data Factory provides some workarounds for us:

You can use the extensibility features of Azure Data Factory to transform files that aren't supported. Two options include Azure Functions and custom tasks by using Azure Batch.

You can see a sample that uses an Azure function to extract the contents of a tar file . For more information, see Azure Functions activity .

You can also build this functionality using a custom dotnet activity. Further information is available here .

Next way, you may need to figure out how to using Azure function to extract the contents of a rar file.

you can use logic apps you can use webhook activity calling a runbook

both are easiee than using a custom activity

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM