部署 Vertex AI 匹配引擎索引时超出配额错误

Question

我正在关注这个示例笔记本并使用了我自己的自定义词嵌入。

在此步骤部署索引时出现以下错误

r = index_endpoint_client.deploy_index(
    index_endpoint=INDEX_ENDPOINT_NAME, deployed_index=deploy_ann_index
)

ResourceExhausted：429 超出以下配额：MatchingEngineDeployedIndexNodes

我已经部署了在上一步中完成的列表端点。

ListIndexEndpointsPager<index_endpoints {
  name: "projects/xxxxxxx/locations/us-central1/indexEndpoints/xxxxxxxx"
  display_name: "index_endpoint_for_demo"
  etag: "AMEw9yPT2hoRkYEEApekieucfedTYyEb5prjgG60WL1pVUfxxxxxxxxxxx"
  create_time {
    seconds: 1645125657
    nanos: 225990000
  }
  update_time {
    seconds: 1645125657
    nanos: 794785000
  }
  network: "projects/xxxxx/global/networks/ucaip-haystack-vpc-network"
}

Answer 1

如果索引使用多个分片，则需要将可用配额除以用于 1 个副本计数的分片数来计算可用配额。 例如，如果您的索引分布在 10 个分片上并且您的配额为 50，则最多可以将 max_replica_count 设置为 5 (50/10)。

您可以通过转到 Metrics Explorer 并查找 Matching Engine 指标来检查您的索引使用了多少分片，请在此处查看更多信息。

部署 Vertex AI 匹配引擎索引时超出配额错误

问题描述

1 个解决方案

解决方案1
0 2022-04-19 20:47:24

部署 Vertex AI 匹配引擎索引时超出配额错误

问题描述

1 个解决方案

解决方案1 0 2022-04-19 20:47:24

解决方案1
0 2022-04-19 20:47:24