简体繁体 English

使用 Azure 数据工厂处理 zip 文件

[英]Processing zip files using Azure Data Factory

原文 2021-04-08 23:27:03 8 1 azure/ azure-data-factory

In Data Lake I am looking for information on how to process.zip file which holds a single.在 Data Lake 中，我正在寻找有关如何处理的信息。zip 文件包含一个文件。 Not sure if I can directly process a contents inside a zip file into Azure Database without unzipping the contents into separate folder.不确定我是否可以直接将 zip 文件中的内容处理到 Azure 数据库中，而无需将内容解压缩到单独的文件夹中。

Another question is the text file is having variable number of column counts in pipe delimiter,as single text file is holding data from multiple tables.另一个问题是文本文件在 pipe 分隔符中的列数是可变的，因为单个文本文件包含来自多个表的数据。 Not sure what are the possible ways to process this variable column count text file.不确定处理此可变列计数文本文件的可能方法是什么。

1 个解决方案

Data Factory supports read the file/data in the.zip file, we can set the Compression type for the.zip file and we don't need unzipping the contents into separate folder.数据工厂支持读取.zip文件中的文件/数据，我们可以为.zip文件设置压缩类型，我们不需要将内容解压缩到单独的文件夹中。 See the example bellow:请参见下面的示例：

For you another question, if the text file has variable number of column counts in pipe delimiter.对于您的另一个问题，如果文本文件在 pipe 分隔符中具有可变的列数。 Data Factory can't do well with this kind of file, it will always consider the first row as the schema according the column delimiter number.数据工厂不能很好地处理这种文件，它总是会根据列分隔符编号将第一行视为架构。 Just for now, there isn't a good way to process it.就目前而言，没有很好的方法来处理它。

HTH. HTH。

将 zip 文件解压缩到 Azure 数据工厂中的 zip 文件中 - Unzip zip files within a zip file in Azure Data Factory

如何使用Azure Data Factory GUI将许多blob文件压缩成一个.gzip？ - How to zip many blob files into one .gzip using Azure Data Factory GUI?

Azure数据工厂将多个文件压缩为一个zip文件 - Azure Data Factory Compress Several Files to one single zip file

使用 Azure 数据工厂从 SFTP 获取 ZIP 文件到 Azure Datalake 并将其存储为 ZCC8D68D68C551C4ADEAFDE6 格式 - Get ZIP file from SFTP to Azure Datalake using Azure data factory and store it into CSV format

通过 Azure 数据工厂从 ftp 站点获取 zip 和非压缩文件 - 解压 - Getting zip and non-zipped files from ftp site via Azure Data Factory - unzipping

Azure 数据工厂管道中的批处理 - Batch Processing in Azure Data Factory Pipelines

Azure 数据工厂复制文件 - Azure Data Factory Copy Files

Azure 数据工厂 - 使用带有文件路径的 CSV 复制文件 - Azure Data Factory - Copy files using a CSV with filepaths

如何使用 VM 上的文件使用 Azure 数据工厂运行 python 脚本？ - How to run python script with Azure Data Factory using files on a VM?

Azure 逻辑应用程序/数据工厂，使用 FTPS 获取文件 - Azure Logic Apps/Data Factory, getting files using FTPS

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 将 zip 文件解压缩到 Azure 数据工厂中的 zip 文件中 - Unzip zip files within a zip file in Azure Data Factory 如何使用Azure Data Factory GUI将许多blob文件压缩成一个.gzip？ - How to zip many blob files into one .gzip using Azure Data Factory GUI? Azure数据工厂将多个文件压缩为一个zip文件 - Azure Data Factory Compress Several Files to one single zip file 使用 Azure 数据工厂从 SFTP 获取 ZIP 文件到 Azure Datalake 并将其存储为 ZCC8D68D68C551C4ADEAFDE6 格式 - Get ZIP file from SFTP to Azure Datalake using Azure data factory and store it into CSV format 通过 Azure 数据工厂从 ftp 站点获取 zip 和非压缩文件 - 解压 - Getting zip and non-zipped files from ftp site via Azure Data Factory - unzipping Azure 数据工厂管道中的批处理 - Batch Processing in Azure Data Factory Pipelines Azure 数据工厂复制文件 - Azure Data Factory Copy Files Azure 数据工厂 - 使用带有文件路径的 CSV 复制文件 - Azure Data Factory - Copy files using a CSV with filepaths 如何使用 VM 上的文件使用 Azure 数据工厂运行 python 脚本？ - How to run python script with Azure Data Factory using files on a VM? Azure 逻辑应用程序/数据工厂，使用 FTPS 获取文件 - Azure Logic Apps/Data Factory, getting files using FTPS

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM