RFR (XS): Optimize branch frequency of G1's write post-barrier in C2
manc at google.com
Fri Jun 14 05:13:52 UTC 2019
Can I have reviews for this improvement for G1's write post-barrier? More
details are stated on the JBS page. Highlight: it reduces
CPU-cost-per-query by 1% for Google search frontend's production workload.
The RFE could be a duplicate of JDK-8130918. However, this patch does not
improve the performance of the microbenchmark in JDK-8130918. I'm not sure
if this patch fully addresses JDK-8130918.
Chuck Rasbold helped me to figure out the proper fix for the basic block
ordering by looking at the CFG before and after C2's PhaseBlockLayout. The
out-most if branch (xor) has to have a frequency greater than 0.5 to make
the BBs laid out correctly. It is also more conventional to use
than PROB_LIKELY in C2.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the hotspot-gc-dev