简体   繁体   English

将数据从 Amazon Aurora 同步到 Redshift

[英]Sync data from Amazon Aurora to Redshift

I am trying to setup a sync between AWS Aurora and Redshift.我正在尝试在 AWS Aurora 和 Redshift 之间设置同步。 What is the best way to achieve this sync?实现这种同步的最佳方法是什么?

Possible ways to sync can be: -可能的同步方式可以是:-

  1. Query table to find changes in a table(since I am only doing inserts, updates don't matter), export these changes to a flat file in S3 bucket and use Redshift copy command to insert into Redshift.查询表以查找表中的更改(因为我只进行插入,更新无关紧要),将这些更改导出到 S3 存储桶中的平面文件并使用 Redshift 复制命令插入到 Redshift 中。

  2. Use python publisher and Boto3 to publish changes into a Kinesis stream and then consume this stream in Firehose from where I can copy directly into Redshift.使用python 发布者和 Boto3 将更改发布到 Kinesis stream,然后在 Firehose 中使用这个 stream,我可以从那里直接复制到 Redshift。

  3. Use Kinesis Agent to detect changes in binlog (Is it possible to detect changes int binlog using Kinesis Agent) and publish it to Firehose and from there copy into Firehose.使用 Kinesis Agent 检测 binlog 中的更改(是否可以使用 Kinesis Agent 检测 int binlog 中的更改)并将其发布到 Firehose,然后从那里复制到 Firehose。

I haven't explored AWS Datapipeline yet.我还没有探索 AWS Datapipeline。

As pointed out by @Mark B, the AWS Database Migration Service can migrate data between databases.正如@Mark B 所指出的, AWS Database Migration Service可以在数据库之间迁移数据。 This can be done as a one-off exercise, or it can run continuously, keeping two databases in sync.这可以作为一次性练习完成,也可以连续运行,保持两个数据库同步。

The documentation shows that Amazon Aurora can be a source and Amazon Redshift can be a target .文档显示Amazon Aurora 可以是源,Amazon Redshift 可以是目标

AWS has just announced this new feature: Amazon Aurora zero-ETL integration with Amazon Redshift AWS 刚刚宣布了这项新功能: Amazon Aurora 与 Amazon Redshift 的零 ETL 集成

This natively provides near real-time (second) synchronization from Aurora to Redshift.这在本地提供了从 Aurora 到 Redshift 的近实时(秒)同步。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 亚马逊极光阅读器 - Amazon Aurora Reader Amazon Redshift 数据 API REST HTTP 端点返回<unknownoperationexception />来自 Salesforce Apex 标注 - Amazon Redshift Data API REST HTTP endpoint returns <UnknownOperationException/> from Salesforce Apex callouts 用户注册后,从 laravel 将用户数据从 amazon cognito 同步到我的 mysql 数据库 - Sync user data from amazon cognito to my mysql database from laravel after user is registered 如何在Amazon Redshift查询select上个季度/最近2个月的数据 - How to select last quarter / last 2 months of data in Amazon Redshift query 如何通过 Python Boto3 将数据加载到 Amazon Redshift? - How to Load Data into Amazon Redshift via Python Boto3? Amazon Redshift 使用 COPY 命令仅从 JSON 获取 1 行 - Amazon Redshift gets only 1 row from JSON with COPY command 如何仅从 PostgreSQL 8.0.2 Amazon Redshift 中的字符串中提取数字 - How to extract only numerics from string in PostgreSQL 8.0.2 Amazon Redshift 连接 Amazon Redshift 中的字段 - Concatenating fields in Amazon Redshift 如何从 AWS Aurora 上的文件导入某些数据 - How to import certain data from file on AWS Aurora 无法将此查询 function 转换为 Amazon Redshift - Can't convert this query function to Amazon Redshift
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM