简体   繁体   English

使用 Cloud Datapreb 与使用 DBT 在层之间转换数据有什么区别?

[英]What are the differences between using Cloud Datapreb vs using DBT for transforming data between layers?

I've worked on two projects where DBT is used to transform data between bronze(raw) silver(refined) and gold(serving) layers.我从事过两个项目,其中 DBT 用于在青铜(原始)银(精制)和金(服务)层之间转换数据。 I know that cloud Dataprep can be also used to transform data between layers and prepare it for visualization and ML/AI.我知道 Cloud Dataprep 也可用于在层之间转换数据并为可视化和 ML/AI 做好准备。

So what are the differences between using these two in terms of skills, budget, ease of use, and setup, what are use cases where one can't be substituted with the other?那么,在技能、预算、易用性和设置方面使用这两者之间有什么区别,哪些用例不能用另一个替代?

The more direct analogue to what DBT does is a different GCP service called Dataform .与 DBT 所做的更直接的类似是称为Dataform的不同 GCP 服务。 Both of these services can be used to execute version controlled, templated SQL queries to transform data in stages.这两种服务都可用于执行版本控制、模板化 SQL 查询以分阶段转换数据。 To use them you need to have a good understanding of your data so that you know what transformations are appropriate.要使用它们,您需要对数据有很好的了解,以便知道哪些转换是合适的。

My understanding is that DataPrep is a fully fledged data exploration and manipulation took;我的理解是 DataPrep 是一个完全成熟的数据探索和操作; it's more for working with data that you don't yet understand and transforming it for use.它更多地用于处理您尚不了解的数据并将其转换为使用。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM