Skip to content

8358329: AArch64: emit direct branches in static stubs for small code caches #25702

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 3 commits into from

Conversation

mikabl-arm
Copy link
Contributor

@mikabl-arm mikabl-arm commented Jun 9, 2025

In the A64 ISA, the B (direct branch) instruction can encode a target within a ±128MB range relative to the instruction. Due to this limitation, when generating static stubs, HotSpot conservatively emits indirect branches for calls to c2i interface stubs. These indirect branches are implemented using a four-instruction sequence: three instructions to materialize the target address in a register, followed by a BR instruction to perform the jump.

This patch optimizes static stub generation when the code cache is small enough to guarantee that the target entry point of the c2i interface stub lies within the direct branch range. In such cases, a single direct B instruction can be used instead of the indirect sequence, saving 3 instructions (12 bytes) per static stub.

Below is an example of the optimization's impact, measured using the movie-lens benchmark from the Renaissance benchmark suite:

Metric Before After Difference
totalInHeap Avg: 1883.875 Avg: 1871.667 -0.65%
Sum: 6653848 Sum: 6616344 -0.56%
stubCode Avg: 103.164 Avg: 87.285 -15.38%
Sum: 364376 Sum: 308552 -15.33%

Full jtreg passed on AArch64.


Progress

  • Change must be properly reviewed (1 review required, with at least 1 Reviewer)
  • Change must not contain extraneous whitespace
  • Commit message must refer to an issue

Issue

  • JDK-8358329: AArch64: emit direct branches in static stubs for small code caches (Enhancement - P4)

Reviewers

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/25702/head:pull/25702
$ git checkout pull/25702

Update a local copy of the PR:
$ git checkout pull/25702
$ git pull https://git.openjdk.org/jdk.git pull/25702/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 25702

View PR using the GUI difftool:
$ git pr show -t 25702

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/25702.diff

Using Webrev

Link to Webrev Comment

… caches

In the A64 ISA, the B (direct branch) instruction can encode a target
within a ±128MB range relative to the instruction. Due to this
limitation, when generating static stubs, HotSpot conservatively emits
indirect branches for calls to c2i interface stubs. These indirect
branches are implemented using a four-instruction sequence: three
instructions to materialize the target address in a register, followed
by a BR instruction to perform the jump.

This patch optimizes static stub generation when the code cache is
small enough to guarantee that the target entry point of the c2i
interface stub lies within the direct branch range. In such cases, a
single direct B instruction can be used instead of the indirect
sequence, saving 3 instructions (12 bytes) per static stub.

Below is an example of the optimization's impact, measured using the
movie-lens benchmark from the Renaissance benchmark suite:

| Metric      | Before        | After         | Difference |
|-------------|---------------|---------------|------------|
| totalInHeap | Avg: 1883.875 | Avg: 1871.667 | -0.65%     |
|             | Sum: 6653848  | Sum: 6616344  | -0.56%     |
| stubCode    | Avg: 103.164  | Avg: 87.285   | -15.38%    |
|             | Sum: 364376   | Sum: 308552   | -15.33%    |
@bridgekeeper
Copy link

bridgekeeper bot commented Jun 9, 2025

👋 Welcome back mablakatov! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

@openjdk
Copy link

openjdk bot commented Jun 9, 2025

@mikabl-arm This change now passes all automated pre-integration checks.

ℹ️ This project also has non-automated pre-integration requirements. Please see the file CONTRIBUTING.md for details.

After integration, the commit message for the final commit will be:

8358329: AArch64: emit direct branches in static stubs for small code caches

Reviewed-by: aph, eastigeevich

You can use pull request commands such as /summary, /contributor and /issue to adjust it as needed.

At the time when this comment was updated there had been 100 new commits pushed to the master branch:

As there are no conflicts, your changes will automatically be rebased on top of these commits when integrating. If you prefer to avoid this automatic rebasing, please check the documentation for the /integrate command for further details.

As you do not have Committer status in this project an existing Committer must agree to sponsor your change. Possible candidates are the reviewers of this PR (@theRealAph, @eastig) but any other Committer may sponsor as well.

➡️ To flag this PR as ready for integration with the above commit message, type /integrate in a new comment. (Afterwards, your sponsor types /sponsor in a new comment to perform the integration).

@openjdk openjdk bot added the rfr Pull request is ready for review label Jun 9, 2025
@openjdk
Copy link

openjdk bot commented Jun 9, 2025

@mikabl-arm The following label will be automatically applied to this pull request:

  • hotspot

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command.

@mlbridge
Copy link

mlbridge bot commented Jun 9, 2025

Webrevs

MacroAssembler::pd_patch_instruction can distinguish between the `b`
and `movk movz movz br` sequences. Strictly speaking, the method
patches not a single instruction but a semantically joint sequence of
instructions. Use it directly instead of `NativeJump` and
`NativeGeneralJump` wrapper classes to simplify the implementation and
get rid of an extra icache invalidation.

Other changes in the patch simply clean up code that became redundant.
Copy link
Contributor

@theRealAph theRealAph left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good. Please fix the copyright date.

@mikabl-arm
Copy link
Contributor Author

The error in java/lang/Thread/virtual/stress/GetStackTraceALotWhenBlocking.java#id0 looks similar to what has been previously reported here: https://bugs.openjdk.org/browse/JDK-8344577 . @theRealAph , do you think the patch may cause the error? Or should I open a similar JBS ticket to report it?

Copy link
Contributor

@theRealAph theRealAph left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks.

@openjdk openjdk bot added the ready Pull request is ready to be integrated label Jun 12, 2025
@theRealAph
Copy link
Contributor

The error in java/lang/Thread/virtual/stress/GetStackTraceALotWhenBlocking.java#id0 looks similar to what has been previously reported here: https://bugs.openjdk.org/browse/JDK-8344577 . @theRealAph , do you think the patch may cause the error? Or should I open a similar JBS ticket to report it?

That bug is macOS/x86. So, is the failure you're seeing repeatable?

@mikabl-arm
Copy link
Contributor Author

/integrate

@openjdk openjdk bot added the sponsor Pull request is ready to be sponsored label Jun 13, 2025
@openjdk
Copy link

openjdk bot commented Jun 13, 2025

@mikabl-arm
Your change (at version 2eae70e) is now ready to be sponsored by a Committer.

@mikabl-arm
Copy link
Contributor Author

Hey @eastig , when you have a moment, could you take a look at this as a second reviewer? I'd appreciate your feedback!

Copy link
Member

@eastig eastig left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@eastig
Copy link
Member

eastig commented Jun 18, 2025

/sponsor

@openjdk
Copy link

openjdk bot commented Jun 18, 2025

Going to push as commit ba32b78.
Since your change was applied there have been 114 commits pushed to the master branch:

Your commit was automatically rebased without conflicts.

@openjdk openjdk bot added the integrated Pull request has been integrated label Jun 18, 2025
@openjdk openjdk bot closed this Jun 18, 2025
@openjdk openjdk bot removed ready Pull request is ready to be integrated rfr Pull request is ready for review sponsor Pull request is ready to be sponsored labels Jun 18, 2025
@openjdk
Copy link

openjdk bot commented Jun 18, 2025

@eastig @mikabl-arm Pushed as commit ba32b78.

💡 You may see a message that your pull request was closed with unmerged commits. This can be safely ignored.

@mikabl-arm mikabl-arm requested a review from theRealAph June 18, 2025 13:16
@mikabl-arm
Copy link
Contributor Author

Sorry @theRealAph , I've re-requested a review by mistake. Please ignore it.

@dean-long
Copy link
Member

This is causing failures in Oracle tier5 testing. See JDK-8359963.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
hotspot [email protected] integrated Pull request has been integrated
Development

Successfully merging this pull request may close these issues.

4 participants