简体   繁体   English

如何使用 Azure 数据工厂对日期或键列执行增量加载

[英]How to perform Incremental Load with date or key column using Azure data factory

I wanted to achieve an incremental load from oracle to Azure SQL data warehouse using azure data factory.我想使用 azure 数据工厂实现从 oracle 到 Azure SQL 数据仓库的增量加载。 The Issue that I am facing is I don't have any date column or any key column to perform Incremental load Is there any other way to achieve this.我面临的问题是我没有任何日期列或任何关键列来执行增量加载有没有其他方法可以实现这一点。

You will either have to:您要么必须:

A. Identify a field in each table you want to use to determine if the row has changed B. Implement some kind of change capture feature on the source data A. 确定每个表中要用于确定行是否已更改的字段 B. 对源数据实施某种更改捕获功能

Those are really the only the only two ways to limit the amount of data you pull from the source.这实际上是限制从源提取的数据量的仅有的两种方法。

It wouldn't be very efficient, but if you are just trying not to update rows that haven't changed in your destination, you can hash your source values and hash the values in the destination, and only insert/update rows where the hashes don't match.它的效率不会很高,但是如果您只是想不更新目标中未更改的行,则可以散列源值并散列目标中的值,并且仅插入/更新散列的行不匹配。 Here's an example of how this works in T-SQL.下面是一个在 T-SQL 中如何工作的示例

There is a section of the Data Factory documentation dedicated to incrementally loading data. 数据工厂文档中有一部分专门用于增量加载数据。 Please check it out if you haven't.如果你还没有,请检查一下。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM