简体繁体中英

What does “max_batch_size” mean in tensorflow-serving batching_config.txt?

原文 2018-11-02 03:54:07 4 1 gpu/ tensorflow-serving

I'm using tensorflow-serving on GPUs with --enable-batching=true .

However, I'm a little confused with max_batch_size in batching_config.txt .

My client sends a input tensor with a tensor shape [-1, 1000] in a single gRPC request, dim0 ranges from (0， 200] . I set max_batch_size = 100 and receive an error:

"gRPC call return code: 3:Task size 158 is larger than maximum batch size 100"

"gRPC call return code: 3:Task size 162 is larger than maximum batch size 100"

Looks like max_batch_size limits dim0 of a single request, but tensorflow batches multiple requests to a batch, I thought it means the sum of request numbers.

1 answers

Here is a direct description from the docs .

max_batch_size: The maximum size of any batch. This parameter governs the throughput/latency tradeoff, and also avoids having batches that are so large they exceed some resource constraint (eg GPU memory to hold a batch's data).

In ML most of the time the first dimension represents a batch. So based on my understanding tensorflow serving confuses the value for the first dimension as a batch and issues errors whenever it is bigger than the allowed value. You can verify it by issuing some of the request where you manually control the first dimension to be lower than 100. I expect this to remove the error.

After that you can modify your inputs to be sent in a proper format.

Debugging batching in Tensorflow Serving (no effect observed)

What does this warning message mean, tensorflow:Efficient allreduce is not supported for 4 IndexedSlices?

What to do with batch size when TensorFlow says “there could be performance gains if more memory is available”

What does the print of DMA and tensorflow mean? And is it possible to set them?

What does it mean to “finalize” in Julia?

Using batch size with TensorFlow Validation Monitor

What does the error "Failed resolution of: Lorg/tensorflow/lite/gpu/GpuDelegateFactory$Options" mean?

What does SIMD mean?

What is the reason that TensorFlow does not detect GPU on Windows

tensorflow GPU crashes for 0 batch size CUDNN_STATUS_BAD_PARAM

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Debugging batching in Tensorflow Serving (no effect observed) What does this warning message mean, tensorflow:Efficient allreduce is not supported for 4 IndexedSlices? What to do with batch size when TensorFlow says “there could be performance gains if more memory is available” What does the print of DMA and tensorflow mean? And is it possible to set them? What does it mean to “finalize” in Julia? Using batch size with TensorFlow Validation Monitor What does the error "Failed resolution of: Lorg/tensorflow/lite/gpu/GpuDelegateFactory$Options" mean? What does SIMD mean? What is the reason that TensorFlow does not detect GPU on Windows tensorflow GPU crashes for 0 batch size CUDNN_STATUS_BAD_PARAM

Related Tags

What does “max_batch_size” mean in tensorflow-serving batching_config.txt?

Question

1 answers

solution1 0 2018-11-02 05:42:50

solution1
0 2018-11-02 05:42:50