简体   繁体   English

JIT没有优化涉及Integer.MAX_VALUE的循环

[英]JIT not optimizing loop that involves Integer.MAX_VALUE

While writing an answer to another question , I noticed a strange border case for JIT optimization. 在写另一个问题的答案时,我注意到JIT优化的一个奇怪的边界情况。

The following program is not a "Microbenchmark" and not intended to reliably measure an execution time (as pointed out in the answers to the other question). 以下程序不是 “Microbenchmark”, 也不是为了可靠地测量执行时间(如另一个问题的答案中所指出的)。 It is solely intended as an MCVE to reproduce the issue: 它仅用作MCVE来重现该问题:

class MissedLoopOptimization
{
    public static void main(String args[])
    {
        for (int j=0; j<3; j++)
        {
            for (int i=0; i<5; i++)
            {
                long before = System.nanoTime();
                runWithMaxValue();
                long after = System.nanoTime();
                System.out.println("With MAX_VALUE   : "+(after-before)/1e6);
            }
            for (int i=0; i<5; i++)
            {
                long before = System.nanoTime();
                runWithMaxValueMinusOne();
                long after = System.nanoTime();
                System.out.println("With MAX_VALUE-1 : "+(after-before)/1e6);
            }
        }
    }

    private static void runWithMaxValue()
    {
        final int n = Integer.MAX_VALUE;
        int i = 0;
        while (i++ < n) {}
    }

    private static void runWithMaxValueMinusOne()
    {
        final int n = Integer.MAX_VALUE-1;
        int i = 0;
        while (i++ < n) {}
    }
}

It basically runs the same loop, while (i++ < n){} , where the limit n is once set to Integer.MAX_VALUE , and once to Integer.MAX_VALUE-1 . 它基本上运行相同的循环, while (i++ < n){} ,其中限制n一次设置为Integer.MAX_VALUE ,一次设置为Integer.MAX_VALUE-1

When executing this on Win7/64 with JDK 1.7.0_21 and 在Win7 / 64上使用JDK 1.7.0_21和。执行此操作时

java -server MissedLoopOptimization

the timing results are as follows: 时间结果如下:

...
With MAX_VALUE   : 1285.227081
With MAX_VALUE   : 1274.36311
With MAX_VALUE   : 1282.992203
With MAX_VALUE   : 1292.88246
With MAX_VALUE   : 1280.788994
With MAX_VALUE-1 : 6.96E-4
With MAX_VALUE-1 : 3.48E-4
With MAX_VALUE-1 : 0.0
With MAX_VALUE-1 : 0.0
With MAX_VALUE-1 : 3.48E-4

Obviously, for the case of MAX_VALUE-1 , the JIT does what one could expect: It detects that the loop is useless, and completely eliminates it. 显然,对于MAX_VALUE-1的情况,JIT会做出人们所期望的:它检测到循环是无用的,并完全消除它。 However, it does not remove the loop when it is running up to MAX_VALUE . 然而,当它运行到它不会删除环MAX_VALUE

This observation is confirmed by a look at the JIT assembly output when starting with 开始时查看JIT组件输出可以确认这一观察结果

java -server -XX:+UnlockDiagnosticVMOptions -XX:+TraceClassLoading -XX:+LogCompilation -XX:+PrintAssembly MissedLoopOptimization

The log contains the following assembly for the method that runs up to MAX_VALUE : 该日志包含运行最多MAX_VALUE的方法的以下程序集:

Decoding compiled method 0x000000000254fa10:
Code:
[Entry Point]
[Verified Entry Point]
[Constants]
  # {method} &apos;runWithMaxValue&apos; &apos;()V&apos; in &apos;MissedLoopOptimization&apos;
  #           [sp+0x20]  (sp of caller)
  0x000000000254fb40: sub    $0x18,%rsp
  0x000000000254fb47: mov    %rbp,0x10(%rsp)    ;*synchronization entry
                                                ; - MissedLoopOptimization::runWithMaxValue@-1 (line 29)
  0x000000000254fb4c: mov    $0x1,%r11d
  0x000000000254fb52: jmp    0x000000000254fb63
  0x000000000254fb54: nopl   0x0(%rax,%rax,1)
  0x000000000254fb5c: data32 data32 xchg %ax,%ax
  0x000000000254fb60: inc    %r11d              ; OopMap{off=35}
                                                ;*goto
                                                ; - MissedLoopOptimization::runWithMaxValue@11 (line 30)
  0x000000000254fb63: test   %eax,-0x241fb69(%rip)        # 0x0000000000130000
                                                ;*goto
                                                ; - MissedLoopOptimization::runWithMaxValue@11 (line 30)
                                                ;   {poll}
  0x000000000254fb69: cmp    $0x7fffffff,%r11d
  0x000000000254fb70: jl     0x000000000254fb60  ;*if_icmpge
                                                ; - MissedLoopOptimization::runWithMaxValue@8 (line 30)
  0x000000000254fb72: add    $0x10,%rsp
  0x000000000254fb76: pop    %rbp
  0x000000000254fb77: test   %eax,-0x241fb7d(%rip)        # 0x0000000000130000
                                                ;   {poll_return}
  0x000000000254fb7d: retq   
  0x000000000254fb7e: hlt    
  0x000000000254fb7f: hlt    
[Exception Handler]
[Stub Code]
  0x000000000254fb80: jmpq   0x000000000254e820  ;   {no_reloc}
[Deopt Handler Code]
  0x000000000254fb85: callq  0x000000000254fb8a
  0x000000000254fb8a: subq   $0x5,(%rsp)
  0x000000000254fb8f: jmpq   0x0000000002528d00  ;   {runtime_call}
  0x000000000254fb94: hlt    
  0x000000000254fb95: hlt    
  0x000000000254fb96: hlt    
  0x000000000254fb97: hlt    

One can clearly see the loop, with the comparison to 0x7fffffff and the jump back to inc . 人们可以清楚地看到循环,与0x7fffffff的比较和跳回到inc In contrast to that, the assembly for the case where it is running up to MAX_VALUE-1 : 与此相反,它运行到MAX_VALUE-1的情况下的程序集:

Decoding compiled method 0x000000000254f650:
Code:
[Entry Point]
[Verified Entry Point]
[Constants]
  # {method} &apos;runWithMaxValueMinusOne&apos; &apos;()V&apos; in &apos;MissedLoopOptimization&apos;
  #           [sp+0x20]  (sp of caller)
  0x000000000254f780: sub    $0x18,%rsp
  0x000000000254f787: mov    %rbp,0x10(%rsp)    ;*synchronization entry
                                                ; - MissedLoopOptimization::runWithMaxValueMinusOne@-1 (line 36)
  0x000000000254f78c: add    $0x10,%rsp
  0x000000000254f790: pop    %rbp
  0x000000000254f791: test   %eax,-0x241f797(%rip)        # 0x0000000000130000
                                                ;   {poll_return}
  0x000000000254f797: retq   
  0x000000000254f798: hlt    
  0x000000000254f799: hlt    
  0x000000000254f79a: hlt    
  0x000000000254f79b: hlt    
  0x000000000254f79c: hlt    
  0x000000000254f79d: hlt    
  0x000000000254f79e: hlt    
  0x000000000254f79f: hlt    
[Exception Handler]
[Stub Code]
  0x000000000254f7a0: jmpq   0x000000000254e820  ;   {no_reloc}
[Deopt Handler Code]
  0x000000000254f7a5: callq  0x000000000254f7aa
  0x000000000254f7aa: subq   $0x5,(%rsp)
  0x000000000254f7af: jmpq   0x0000000002528d00  ;   {runtime_call}
  0x000000000254f7b4: hlt    
  0x000000000254f7b5: hlt    
  0x000000000254f7b6: hlt    
  0x000000000254f7b7: hlt    

So my question is: What is so special about the Integer.MAX_VALUE that prevents the JIT from optimizing it in the same way as it does for Integer.MAX_VALUE-1 ? 所以我的问题是: Integer.MAX_VALUE有什么特别之处,它阻止JIT以与Integer.MAX_VALUE-1相同的方式对其进行优化? My guess would be that has to do with the cmp instruction, which is intended for signed arithmetic, but that alone is not really a convincing reason. 我猜这将与cmp指令有关, cmp指令用于签名算术,但仅此一点并不是一个令人信服的理由。 Can anybody explain this, and maybe even give a pointer to the OpenJDK HotSpot code where this case is treated? 任何人都可以解释这一点,甚至可能会给出一个指向这个案例的OpenJDK HotSpot代码的指针吗?

(An aside: I hope that the answer will also explain the different behavior between i++ and ++i that was asked for in the other question, assuming that the reason for the missing optimization is (obviously) actually caused by the Integer.MAX_VALUE loop limit) (旁白:我希望答案也能解释i++++i之间在另一个问题中要求的不同行为,假设缺失优化的原因(显然) 实际上是由Integer.MAX_VALUE循环引起的限制)

I have not dug up the Java Language Specification, but I'd guess that it has to do with this difference: 我没有挖出Java语言规范,但我猜它与这种差异有关:

  • i++ < (Integer.MAX_VALUE - 1) never overflows. i++ < (Integer.MAX_VALUE - 1)永不溢出。 Once i reaches Integer.MAX_VALUE - 1 it is incremented to Integer.MAX_VALUE and then the loop terminates. 一旦i到达Integer.MAX_VALUE - 1它就会增加到Integer.MAX_VALUE ,然后循环终止。

  • i++ < Integer.MAX_VALUE contains an integer overflow. i++ < Integer.MAX_VALUE包含整数溢出。 Once i reaches Integer.MAX_VALUE , it is incremented by one causing an overflow and then the loop terminates. 一旦i到达Integer.MAX_VALUE ,它会加1,导致溢出, 然后循环终止。

I assume that the JIT compiler is "reluctant" to optimize-out loops with such corner conditions - there was a whole bunch of bugs wrt loop optimization in integer overflow conditions, so that reluctance is probably quite warranted. 我假设JIT编译器“不愿意”在这种极端条件下优化输出循环 - 在整数溢出条件下有一大堆错误和循环优化,因此可能非常需要不情愿。

There may also be some hard requirement that does not allow integer overflows to be optimized-out, although I somehow doubt that since integer overflows are not directly detectable or otherwise handled in Java. 可能还有一些硬性要求不允许对整数溢出进行优化,尽管我有点怀疑,因为整数溢出不能直接检测到或以其他方式在Java中处理。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM