简体   繁体   English

使用增量ID作为水印在Azure数据工厂管道中复制数据而不是日期时间

[英]Using an incremental id as watermark for copying data in azure data factory pipeline instead of date time

I'm able to incrementally load data from an source Azure MSSQL DB to a sink Azure MSSQL DB using a timestamp. 我可以使用时间戳将数据从源Azure MSSQL DB增量加载到接收器Azure MSSQL DB。 For some reason i wish to incrementally load the data using an incremental id in the source database instead of a timestamp. 由于某种原因,我希望使用源数据库中的增量ID(而不是时间戳)增量加载数据。 Is this possible? 这可能吗?

I need to run the Copy Data activity only once a day. 我只需要每天运行一次“复制数据”活动。 So i would also want to store the batch id for each of the copy data activity in a batch_details table. 因此,我也想将每个复制数据活动的批次ID存储在batch_details表中。

I'm using ADF v.2 我正在使用ADF v.2

I'm a new to azure, How do i do it? 我是天蓝色的新手,我该怎么做?

What you ask is basically what is explained in this tutorial from the official documentation: https://docs.microsoft.com/en-us/azure/data-factory/tutorial-incremental-copy-overview 您要问的基本上是官方文档中本教程中介绍的内容: https : //docs.microsoft.com/zh-cn/azure/data-factory/tutorial-incremental-copy-overview

You can use whatever you want as a watermark, the tutorial uses a datetime value, but an incremental id works too. 您可以使用任何想要的水印,本教程使用datetime值,但是增量id也可以。

To run the pipeline once a day, use a trigger: https://docs.microsoft.com/en-us/azure/data-factory/concepts-pipeline-execution-triggers#schedule-trigger 要每天运行一次管道,请使用触发器: https : //docs.microsoft.com/zh-cn/azure/data-factory/concepts-pipeline-execution-triggers#schedule-trigger

Hope this helped! 希望这对您有所帮助!

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM