简体   繁体   中英

Huffman minimum variance coding

it is well known that Huffman code with minimum variance is preferable. I've digged through entire Polish/English internet and this is what I found: to build Huffman code with minimum variance you need to break ties with one of the following methods (of course probability of node is the most important):

  • Select node that represents shortest tree
  • Select node that was created earliest (consider leafs as created at start).

the problem is, that I couldn't find any proof of correctness of any of these methods. Can someone proof any of these?

I will gladly clarify anything.

Some systems have an even stronger constraint than "when there's a tie, make the choice that minimizes the maximum depth of a tree" -- they set a hard limit on the maximum depth of the tree ( length-limited, also called minimum variance Huffman coding ):

My understanding is that people have developed several heuristic algorithms for limiting Huffman codeword lengths that work OK, but the heuristics don't always give exactly the best possible compression .

Several people mention "optimal length-limited Huffman codes", and apparently there exist more than one algorithm to find them:

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM