简体   繁体   English

云数据融合管道中的 SCD 类型 2 实现

[英]SCD type 2 implementation in Cloud Data Fusion Pipeline

I have requirement to apply SCD2 in my data fusion pipeline and my source data is GCS and target is BigQuery.我需要在我的数据融合管道中应用 SCD2,我的源数据是 GCS,目标是 BigQuery。 I have created the pipeline and able to load the data into target BigQery so now I want to implement merge and update process as part of SCD2.我已经创建了管道并能够将数据加载到目标 BigQery 中,所以现在我想将合并和更新过程作为 SCD2 的一部分来实现。 Can you please help me as I'm new to Google cloud data fusion.你能帮我吗,因为我是谷歌云数据融合的新手。

@phaneendra's approach is a possible solution. @phaneendra 的方法是一种可能的解决方案。 Anyhow, Data Fusion is designed to be used in a code-free visual way.无论如何,Data Fusion 旨在以无代码的可视化方式使用。 If you want to take advantage of its potential, I would recommend you to use Wrangler for your data transformations.如果您想利用它的潜力,我建议您使用 Wrangler 进行数据转换。 In Google's docs you can find an interesting and quick tutorial on how to use it to clean, transform, and process customer data.在 Google 的文档中,您可以找到有关如何使用它来清理、转换和处理客户数据的有趣且快速的教程 Then you would just have to sink the transformed data with the BigQuery plugin as you are doing now.然后,您只需像现在一样使用BigQuery 插件接收转换后的数据。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM