Native wrapper optimization
changren at taobao.com
Thu Nov 17 22:39:06 PST 2011
Attached patch(diff with hsx20) is supposed to speed up native
invocation. It rearranges the compiled-to-native wrapper code to
straighten branches which improves spatial locality. Micro
benchmark(500m consecutive JNI invocations with warm up) shows the
stalled CPU cycles caused by instruction fetch due to L1 ICache miss
decrease 3.4% on Intel Nehalem microarchitecture and 9.6% on Core
microarchitecture. The real execution time of the micro benchmark is
also decreased 5-10% respectively which reflects the improvement.
This email (including any attachments) is confidential and may be legally privileged. If you received this email in error, please delete it immediately and do not copy it or use it for any purpose or disclose its contents to any other person. Thank you.
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
More information about the hotspot-dev