关于执行 Java HashMap

Question

为什么容量必须是倍数或2？ 为什么在 indexFor 函数中使用“&”？ 为什么要重新计算 hash function 中的 hash 而不是直接使用密钥的 hash 编码？

我认为这个实现和“算法简介”中的描述有一些重要的区别。

“>>>”是什么意思？

static int hash(int h) {
        // This function ensures that hashCodes that differ only by
        // constant multiples at each bit position have a bounded
        // number of collisions (approximately 8 at default load factor).
        h ^= (h >>> 20) ^ (h >>> 12);
        return h ^ (h >>> 7) ^ (h >>> 4);
}

谁能给我一些指导？ 我很感激如果有人能解释 hash 算法。 非常感谢！

Answer 1

这是一个性能优化。 map hash 代码到表索引的通常方法是

table_index = hash_code % table_length;

%运算符很昂贵。 如果table_length是2的幂，那么计算：

table_index = hash_code & (table_length - 1);

相当于（多）更昂贵的模运算。

Answer 2

不要注意窗帘后面的那个人。

实际的算法无疑是开发人员“感觉良好”的组合，对一些奇怪的退化情况的修复，以及简单的传统（用户经常为此开发模糊的依赖关系）。

并注意这一点：

 * Applies a supplemental hash function to a given hashCode, which * defends against poor quality hash functions. This is critical * because HashMap uses power-of-two length hash tables, that * otherwise encounter collisions for hashCodes that do not differ * in lower bits. Note: Null keys always map to hash 0, thus index 0.

网：只要能用，性能好，你无所谓。

关于执行 Java HashMap

问题描述

2 个解决方案

解决方案1
5 已采纳 2012-04-03 21:34:28

解决方案2
0 2012-04-03 21:46:02

关于执行 Java HashMap

问题描述

2 个解决方案

解决方案1 5 已采纳 2012-04-03 21:34:28

解决方案2 0 2012-04-03 21:46:02

解决方案1
5 已采纳 2012-04-03 21:34:28

解决方案2
0 2012-04-03 21:46:02