简体繁体中英

x264 threading latency

原文 2012-07-22 11:40:03 0 3 latency/ x264

I wonder why sliceless threading ( http://akuvian.org/src/x264/sliceless_threads.txt ) in x264 leads to latency? If I have for example 2 threads the first encode one frame and the second encode one frame. The seconds have to wait for the first in some cases. But they can be encoded in parallel.

So two threads should be faster than only one, right?

3 answers

Frame-threading add latency in frames not in seconds because you need to feed encoder with more input frames before you start getting output frames (to fill pipeline). Encoding one frame itself will take about near same processor time as with one thread but threading allow pipeline process by encoding different frames parallel . From other hand sliced-threading decrease latency because all threads encode one frame parallel so it would be finished faster than encoding it with one thread (also sliced-threading don't need latency in frames for pipepining).

It took me quite a while to reason through it, but the answer is Queuing Theory.

Each frame can be started when half of the previous frame has been encoded. But if parallelization is going to provide any benefit most (preferably all) threads should have a frame to work on. 5 threads means 5 frames. That is the pipeline. Any time the pipeline is not completely full, parallelization is giving you less of a benefit. If the pipeline contains only one frame, only one thread is working and therefore you get no benefit from parallelization. But if your pipeline is usually full, what is it full of? Unencoded frames. Unencoded frames are frames that must have been captured and therefore they represent that many frames worth of latency. The latency might be slightly less by a small constant portion of a frame because some of those frames in the pipeline are partially encoded but in general each item in the pipeline contributes to the latency.

One reason for added latency with more threads is that the consecutive frames use each other for motion prediction and compensation. That means in order to compress a frame you need info from previous motion estimation details. That means the frames are dependant on each other and sometimes they have to wait for at least some data from other threads as well. This is in contrast with the slice threading when threads slicing up the frame and each one works on one slice and all on the same frame and they have all the needed info from previous frames, or next in case of B frames.

x86: latency and throughput of transcendental functions

Estimating of interrupt latency on the x86 CPUs

Bluetooth Low Energy Lag / Latency on OS X 10.11 El Capitan

Is there latency of an application?

Latency is not identified

EC2 latency, and latency in general

Transaction size and latency between: CPU and RAM, RAM and PCIE2.0 16x device

Processing Latency

Elasticsearch Latency

wrk --latency the mean of latency distribution

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question x86: latency and throughput of transcendental functions Estimating of interrupt latency on the x86 CPUs Bluetooth Low Energy Lag / Latency on OS X 10.11 El Capitan Is there latency of an application? Latency is not identified EC2 latency, and latency in general Transaction size and latency between: CPU and RAM, RAM and PCIE2.0 16x device Processing Latency Elasticsearch Latency wrk --latency the mean of latency distribution

Related Tags

x264 threading latency

Question

3 answers

solution1
3 2012-07-24 11:43:10

solution2
2 2012-07-22 20:02:25

solution3
0 2019-10-08 13:35:39

x264 threading latency

Question

3 answers

solution1 3 2012-07-24 11:43:10

solution2 2 2012-07-22 20:02:25

solution3 0 2019-10-08 13:35:39

solution1
3 2012-07-24 11:43:10

solution2
2 2012-07-22 20:02:25

solution3
0 2019-10-08 13:35:39