简体   繁体   English

kubeflow 部署 gcp 端点 controller 失败

[英]kubeflow deploy gcp endpoints controller fails

I am trying to deploy kubeflow on gcp using official guidelines https://www.kubeflow.org/docs/distributions/gke/deploy/deploy-cli/我正在尝试使用官方指南https://www.kubeflow.org/docs/distributions/gke/deploy/deploy-cli/ 在 gcp 上部署 kubeflow

I tried three times but it seems there is a problem with endpoints controller.我尝试了 3 次,但似乎端点 controller 有问题。 When checked by: kubectl -n kubeflow get all检查时:kubectl -n kubeflow get all

All pods are running except the所有 pod 都在运行,除了

    NAME                                                             READY   STATUS             RESTARTS   AGE
pod/admission-webhook-deployment-667bd68d94                1/1     Running            
pod/cache-deployer-deployment-75ccdc98b4                   2/2     Running            
pod/cache-server-56f78bf64b                                2/2     Running            
pod/centraldashboard-5fdbd9b744                            1/1     Running            
pod/cloud-endpoints-controller-5f7dbc6fc8                  0/1     ImagePullBackOff

Pod desciption says that it failed to resolve reference "gcr.io/cloud-solutions-group/cloud-endpoints-controller:0.2.1": unexpected status code [manifests 0.2.1]: 403 Forbidden Pod deciption 说它无法解析引用“gcr.io/cloud-solutions-group/cloud-endpoints-controller:0.2.1”:意外状态代码 [manifests 0.2.1]:403 Forbidden

I am new to kubeflow but despite retrying this three times it always results in the same issue.我是 kubeflow 的新手,但尽管重试了 3 次,但它总是会导致同样的问题。

You can clone the repo and build the image yourself and push it to your container registry.您可以克隆存储库并自己构建映像并将其推送到您的容器注册表。 This is one workaround to fix this until the official image is back.这是在官方图像恢复之前解决此问题的一种解决方法。

git clone https://github.com/jlewi/cloud-endpoints-controller.git
cd cloud-endpoints-controller
git checkout 0.2.1
docker build . -t <YOUR DOCKER REGISTRY>/cloud-endpoints-controller:0.2.1
docker push <YOUR DOCKER REGISTRY>/cloud-endpoints-controller:0.2.1

And this use the new image in your pod spec.这会在您的 pod 规范中使用新图像。

Thank you for posting this issue!感谢您发布此问题! I have posted a mitigation solution here in https://github.com/kubeflow/gcp-blueprints/issues/343#issuecomment-1028488756 .我在https://github.com/kubeflow/gcp-blueprints/issues/343#issuecomment-1028488756中发布了缓解解决方案。 I am planning to fix this issue in the coming release.我计划在即将发布的版本中解决此问题。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM