简体   繁体   English

私有云数据融合可以连接到互联网吗?

[英]Can a private Cloud data fusion connect to the internet?

Our application is made of a spring-bot app server deployed through "cloud run" and a "cloud sql postgres" database.我们的应用程序由一个通过“cloud run”部署的spring-bot应用服务器和一个“cloud sql postgres”数据库组成。

The database is private and connected to a private VPC .该数据库是私有的,并连接到私有 VPC
The app server can connect to the database through a gateway to this private VPC provided by the "cloud run" configuration.应用服务器可以通过“云运行”配置提供的此私有 VPC 的网关连接到数据库。

We'd like to feed this database with "cloud data fusion" (CDF) periodically.我们希望定期向这个数据库提供“云数据融合”(CDF)。 CDF should fetch data from AWS S3 and push it into our database. CDF 应该从 AWS S3 获取数据并将其推送到我们的数据库中。

We've designed and validated a pipeline for that purpose but we're facing a network paradox:我们为此目的设计并验证了一个管道,但我们正面临一个网络悖论:

  • Either CDF is public, can read from S3 over internet, but can't reach the cloud database CDF 是公共的,可以通过 Internet 从 S3 读取,但无法访问云数据库
  • or CDF is private, can reach our database but can't reach internet for S3 fetching...或者 CDF 是私有的,可以访问我们的数据库,但无法访问 Internet 进行 S3 获取...

How can CDF both write to the private database and read data from the internet ? CDF 如何既可以写入私有数据库又可以从 Internet读取数据?
I'm surprised that a CDF instance, even being private, can't establish an EGRES connection to an internet resouce.我很惊讶 CDF 实例,即使是私有的,也无法建立与 Internet 资源的 EGRES 连接。

Cloud Data fusion is a tool that help you to build pipeline (based on CDAP ). Cloud Data fusion 是一个帮助您构建管道的工具(基于CDAP )。 If you set the Data Fusion private, it's the access to the tool that is private, not the runtime, On Google Cloud, the pipeline runs on Dataproc cluster .如果您将 Data Fusion 设置为私有,则对工具的访问是私有的,而不是运行时,在 Google Cloud 上,管道在Dataproc cluster上运行。

So now, the question is: Can your Dataproc cluster reach internet and your database?所以现在的问题是:您的 Dataproc 集群能否访问互联网和您的数据库?

  1. If your cluster run in the same VPC as your Cloud SQL database private IP connection, and there is no firewall rule that prevent the communication, it's OK如果您的集群与您的云在同一 VPC 中运行 SQL 数据库私有 IP 连接,并且没有阻止通信的防火墙规则,则可以
  2. If your Compute Engines that compose your cluster have public IP, no problem, you can access to public URL.如果组成集群的 Compute Engine 具有公共 IP,没问题,您可以访问公共 URL。 Else, as said by John Hanley, you can create a Cloud NAT to allow your Compute Engine to initiate call to external URL.否则,正如 John Hanley 所说,您可以创建一个 Cloud NAT 以允许您的 Compute Engine 发起对外部 URL 的调用。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM