简体   繁体   中英

Data Pipeline Solution

We have a use-case to build data pipeline solution in which we need following things:

  1. Ability to have multiple steps (outputs from one step should feed as input to next)
  2. Ability to have multiple algorithms (SQL Query or probably invoke REST endpoint) in each step.

Input to first step can be anything. We have DW tables, but we can pre-process and keep the relevant information in AWS S3 or other data store.

Something like this: 数据管道

Is there an existing solution that already provides functionalities similar to this or can be modified to support this?

Having something in AWS would be easier to integrate.

How about AWS Glue? Sounds like a fit to your goals...

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM