在 Google Cloud Vertex AI 上部署客户处理程序

Question

我正在尝试在 Google Vertex AI 平台上部署一个 TorchServe 实例，但根据他们的文档（ https://cloud.google.com/vertex-ai/docs/predictions/custom-container-requirements#response_requirements ），它需要响应具有以下形状：

{
  "predictions": PREDICTIONS
}

其中PREDICTIONS是一个包含 JSON 个值的数组，表示您的容器生成的预测。

不幸的是，当我尝试在自定义处理程序的postprocess()方法中返回这样的形状时，如下所示：

def postprocess(self, data):
    return {
        "predictions": data
    }

TorchServe 返回：

{
  "code": 503,
  "type": "InternalServerException",
  "message": "Invalid model predict output"
}

请注意， data是列表的列表，例如：[[1, 2, 1], [2, 3, 3]]。 （基本上，我是从句子生成嵌入）

现在，如果我只是返回data （而不是 Python 字典），它可以与 TorchServe 一起使用，但是当我在 Vertex AI 上部署容器时，它会返回以下错误： ModelNotFoundException 。 我假设 Vertex AI 抛出此错误，因为返回形状与预期不匹配（参见文档）。

有人成功地在 Vertex AI 上部署了带有自定义处理程序的 TorchServe 实例吗？

Answer 1

实际上，确保 TorchServe 正确处理输入字典（实例）解决了这个问题。 文章中的内容似乎对我不起作用。

在 Google Cloud Vertex AI 上部署客户处理程序

问题描述

1 个解决方案

解决方案1
1 已采纳 2021-10-03 16:29:02

在 Google Cloud Vertex AI 上部署客户处理程序

问题描述

1 个解决方案

解决方案1 1 已采纳 2021-10-03 16:29:02

解决方案1
1 已采纳 2021-10-03 16:29:02