有没有办法从 triton 推理服务器获取 config.pbtxt 文件

Question

Recently, I have come across a solution of the triton serving config file disable flag "--strict-model-config=false" while running the inferencing server.最近，我在运行推理服务器时遇到了 triton 服务配置文件禁用标志“--strict-model-config=false”的解决方案。 This would enable to create its own config file while loading the model from the model repository.这将能够在从模型存储库加载模型时创建自己的配置文件。

sudo docker run --rm --net=host -p 8000:8000 -p 8001:8001 -p 8002:8002 \
-v /home/rajesh/custom_repository:/models nvcr.io/nvidia/tritonserver:22.06-py3 \
tritonserver --model-repository=/models --strict-model-config=false

I would like to get the generated config file from the triton inferencing server since we can play around with the batch config and other parameters.我想从 triton 推理服务器获取生成的配置文件，因为我们可以使用批处理配置和其他参数。 Is there a way to get the inbuilt generated config.pbtxt file for the models I have loaded in the server so that I can play around the batch size and other parameters.有没有办法为我在服务器中加载的模型获取内置生成的 config.pbtxt 文件，以便我可以调整批量大小和其他参数。

Answer 1

As per the reference from the below link, the loaded model configuration can be found after loading the model repository into the triton server using the below curl command.根据以下链接的参考，在使用以下 curl 命令将模型存储库加载到 triton 服务器后，可以找到加载的模型配置。

https://github.com/triton-inference-server/server/blob/main/docs/model_configuration.md#minimal-model-configuration https://github.com/triton-inference-server/server/blob/main/docs/model_configuration.md#minimal-model-configuration

command:命令：

curl localhost:8000/v2/models/<model_name>/config

Answer 2

The above answer which the uses curl command would return the json response.上面使用 curl 命令的答案将返回 json 响应。

If the results should be in the protobuf format, try loading the model using triton inferencing server with strict model config as false and fetch the results by using the below python script which would return the results in necessary protobuf format.如果结果应该是 protobuf 格式，请尝试使用严格模型配置为 false 的 triton 推理服务器加载模型，并使用以下 python 脚本获取结果，该脚本将以必要的 protobuf 格式返回结果。 Use this to get the format of the model and edit it easily as per the needs in config pbtxt file instead of cnoverting json to protobuf results.使用它来获取模型的格式并根据配置 pbtxt 文件中的需要轻松编辑它，而不是将 json 转换为 protobuf 结果。

import tritonclient.grpc as grpcclient

triton_client = grpcclient.InferenceServerClient(url=<triton_server_url>)

model_config = triton_client.get_model_config(model_name=<model_name>, model_version=<model_version>)

有没有办法从 triton 推理服务器获取 config.pbtxt 文件

问题描述

2 个解决方案

解决方案1
0 2022-07-08 06:19:33

解决方案2
0 2022-07-20 05:56:15

有没有办法从 triton 推理服务器获取 config.pbtxt 文件

问题描述

2 个解决方案

解决方案1 0 2022-07-08 06:19:33

解决方案2 0 2022-07-20 05:56:15

解决方案1
0 2022-07-08 06:19:33

解决方案2
0 2022-07-20 05:56:15