简体繁体中英

Maximal SHA-1 Hash Performance Tips in Java

原文 2012-03-14 21:08:56 1 3 java/ performance/ algorithm/ hash/ sha1

I'm writing a Java library that needs to compute SHA-1 hashes. During a common task, the JVM spends about 70% of its time in sun.security.provider.SHA.implCompress , 10% in java.util.zip.Inflater.inflate , and 2% in sun.security.provider.ByteArrayAccess.b2iBig64 . (According to NetBeans profiler.)

I can't seem to get the Google search keywords right to get relevant results. I'm not very familiar with the SHA-1 hash algorithm. How can I get the most performance out of an SHA-1 MessageDigest ? Is there a certain chunk size I should be digesting, or multiples of certain sizes I should try?

To answer some questions you're thinking about asking:

Yes, I'm digesting as I read the files ( MessageDigest.update ), so bytes are only digested once.
The SHA-1 digests are being used as checksums, usually for files that need to be zlib/inflated.
No, I can't use a different hash.
Yes, I know zlib already uses checksums, but external requirements specify the use of SHA-1 hashes on top of that. I can't come up with a good reason why (+1 if you can) :-)

3 answers

也许你可以调用用C编写的本机代码。必须有大量超级优化的SHA1库。

SHA-1 has a block size of 64 bytes, so multiples of that are probably best; otherwise the implementation will need to copy partial blocks into buffers.

Are you running on a multi-core computer? You could run the zlib decompression and SHA-1 hashing in separate threads, using something like java.util.concurrent.SynchronousQueue to hand off each decompressed 64-byte block from the one thread to the other. That way you can have one core hashing one block while another core is decompressing the next block.

(You could try one of the other BlockingQueue implementations that has some storage capacity, but I don't think it'd help much. The decompression is much faster than the hashing, so the zlib thread would quickly fill up the queue and then it'd have to wait to put each new block, just like with the SynchronousQueue .)

I know you said you've optimized I/O already, but are you using asynchronous I/O? For maximum performance you don't want to hash one block and then ask the OS to read the next block, you want to ask the OS to read the next block and then hash the one you already have while the disk is busy fetching the next one. However, the OS probably does some readahead already, so this may not make a big difference.

But beyond all that, a cryptographic hash function is a complex thing; it's just going to take time to run. Maybe you need a faster computer. :-)

Have you tried switching the file processing to a Memory Mapped file? Performance for those tends to be significantly faster than regular IO and NIO.

Java SHA-1 hash an unsigned BYTE

port sha-1 hash from C# to Android/java

Securing passwords SHA-1 Java

How to use SHA-1 hash as AES key

Format sha-1 hash like .net does

How to Hash String using SHA-1 with key?

Regenerating SHA-1 hash not matching stored database hash

Java iterative SHA-1 hashing vs Python

Java Encode SHA-1 Byte Array

SHA-1 hashing on Java and C#

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Java SHA-1 hash an unsigned BYTE port sha-1 hash from C# to Android/java Securing passwords SHA-1 Java How to use SHA-1 hash as AES key Format sha-1 hash like .net does How to Hash String using SHA-1 with key? Regenerating SHA-1 hash not matching stored database hash Java iterative SHA-1 hashing vs Python Java Encode SHA-1 Byte Array SHA-1 hashing on Java and C#

Related Tags

Maximal SHA-1 Hash Performance Tips in Java

Question

3 answers

solution1
1 2012-03-14 22:45:58

solution2
1 ACCPTED 2012-03-15 05:54:02

solution3
0 2012-03-14 21:47:30

Maximal SHA-1 Hash Performance Tips in Java

Question

3 answers

solution1 1 2012-03-14 22:45:58

solution2 1 ACCPTED 2012-03-15 05:54:02

solution3 0 2012-03-14 21:47:30

solution1
1 2012-03-14 22:45:58

solution2
1 ACCPTED 2012-03-15 05:54:02

solution3
0 2012-03-14 21:47:30