CR for RFR 8153998
Berg, Michael C
michael.c.berg at intel.com
Wed Apr 13 06:26:24 UTC 2016
I would like to contribute Programmable SIMD as implemented on multi-versioned post loops. See: https://bugs.openjdk.java.net/browse/JDK-8151573 for the first half of the implementation.
This component delivers mask programmed post loops which execute in a single iteration in place of fixup scalar loops which used to take many iterations to complete work for user loops.
Currently I have enabled this optimization for x86 only, specifically for machines with masked data predication implemented as per fully enabled EVEX targets. It delivers up to 2x performance and has been modeled over a large number of loop lengths and forms of loops.
This code was tested as follows (see jbs entry below):
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the hotspot-compiler-dev