8372460: Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance #28471

wenshao · 2025-11-24T08:02:35Z

This PR optimizes the parsing performance of DateTimeFormatter by replacing HashMap with EnumMap in scenarios where the keys are exclusively ChronoField enum values.

When parsing date/time strings, DateTimeFormatter creates HashMaps to store intermediate parsed values. HashMap has more overhead for operations compared to specialized map implementations.

Since ChronoField is an enum and all keys in these maps are ChronoField instances, we can use EnumMap instead, which provides better performance for enum keys due to its optimized internal structure.

Parsing scenarios show improvements from 12% to 95%

Progress

Change must not contain extraneous whitespace
Commit message must refer to an issue
Change must be properly reviewed (2 reviews required, with at least 2 Reviewers)

Issue

JDK-8372460: Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance (Enhancement - P4)

Reviewers

Chen Liang (@liach - Reviewer)

Reviewers without OpenJDK IDs

@khanbilal732 (no known openjdk.org user name / role)

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/28471/head:pull/28471
$ git checkout pull/28471

Update a local copy of the PR:
$ git checkout pull/28471
$ git pull https://git.openjdk.org/jdk.git pull/28471/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 28471

View PR using the GUI difftool:
$ git pr show -t 28471

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/28471.diff

Using Webrev

Link to Webrev Comment

bridgekeeper · 2025-11-24T08:04:36Z

👋 Welcome back swen! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2025-11-24T08:05:49Z

@wenshao This change is no longer ready for integration - check the PR body for details.

openjdk · 2025-11-24T08:06:35Z

@wenshao The following labels will be automatically applied to this pull request:

core-libs
i18n

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing lists. If you would like to change these labels, use the /label pull request command.

wenshao · 2025-11-24T16:10:59Z

1. Shell

We run the following Shell command

# master
git checkout b6495573e9dc5470df268b63f8e7a93f38406cd2
make test TEST="micro:java.time.format.DateTimeFormatterParse"

# this pr
git checkout d8742d7514abfe0e36f105fa7310fdb1755ae546
make test TEST="micro:java.time.format.DateTimeFormatterParse"

2. Raw Benchmark Data

Performance data running on a MacBook M1 Pro:

# b649557 (master)
Benchmark                                           Mode  Cnt     Score     Error   Units
DateTimeFormatterParse.parseInstant                thrpt   15  2066.130 ± 126.134  ops/ms
DateTimeFormatterParse.parseLocalDate              thrpt   15  5014.987 ± 424.759  ops/ms
DateTimeFormatterParse.parseLocalDateTime          thrpt   15  3821.083 ± 390.928  ops/ms
DateTimeFormatterParse.parseLocalDateTimeWithNano  thrpt   15  3529.090 ± 209.195  ops/ms
DateTimeFormatterParse.parseLocalTime              thrpt   15  4275.904 ± 335.752  ops/ms
DateTimeFormatterParse.parseLocalTimeWithNano      thrpt   15  4596.255 ± 195.175  ops/ms
DateTimeFormatterParse.parseOffsetDateTime         thrpt   15  2330.924 ± 152.061  ops/ms
DateTimeFormatterParse.parseZonedDateTime          thrpt   15  1837.753 ± 107.873  ops/ms

# d8742d7 (this pr)
Benchmark                                           Mode  Cnt     Score     Error   Units
DateTimeFormatterParse.parseInstant                thrpt   15  2900.168 ±  56.079  ops/ms
DateTimeFormatterParse.parseLocalDate              thrpt   15  9787.592 ± 384.437  ops/ms
DateTimeFormatterParse.parseLocalDateTime          thrpt   15  5046.838 ± 271.451  ops/ms
DateTimeFormatterParse.parseLocalDateTimeWithNano  thrpt   15  3963.050 ± 434.662  ops/ms
DateTimeFormatterParse.parseLocalTime              thrpt   15  8196.707 ± 329.547  ops/ms
DateTimeFormatterParse.parseLocalTimeWithNano      thrpt   15  8387.213 ± 652.292  ops/ms
DateTimeFormatterParse.parseOffsetDateTime         thrpt   15  3291.076 ± 294.889  ops/ms
DateTimeFormatterParse.parseZonedDateTime          thrpt   15  2069.595 ± 293.385  ops/ms

3. Performance Comparison

Performance Comparison: b649557 vs d8742d7

Benchmark	`b649557`	`d8742d7`	Improvement Factor
DateTimeFormatterParse.parseInstant	2066.130 ± 126.134	2900.168 ± 56.079	1.404x
DateTimeFormatterParse.parseLocalDate	5014.987 ± 424.759	9787.592 ± 384.437	1.952x
DateTimeFormatterParse.parseLocalDateTime	3821.083 ± 390.928	5046.838 ± 271.451	1.321x
DateTimeFormatterParse.parseLocalDateTimeWithNano	3529.090 ± 209.195	3963.050 ± 434.662	1.123x
DateTimeFormatterParse.parseLocalTime	4275.904 ± 335.752	8196.707 ± 329.547	1.919x
DateTimeFormatterParse.parseLocalTimeWithNano	4596.255 ± 195.175	8387.213 ± 652.292	1.825x
DateTimeFormatterParse.parseOffsetDateTime	2330.924 ± 152.061	3291.076 ± 294.889	1.412x
DateTimeFormatterParse.parseZonedDateTime	1837.753 ± 107.873	2069.595 ± 293.385	1.126x

src/java.base/share/classes/java/time/format/DateTimeFormatterBuilder.java

src/java.base/share/classes/java/time/format/Parsed.java

wenshao · 2025-11-25T06:18:45Z

java/time/tck/java/time/temporal/TCKWeekFields.java
java/time/tck/java/time/temporal/TCKIsoFields.java
java/time/tck/java/time/temporal/TCKJulianFields.java
java/time/tck/java/time/format/TCKDateTimeParseResolver.java
java/time/tck/java/time/format/TCKLocalizedFieldParser.java
java/time/tck/java/time/format/TCKDateTimeFormatters.java
java/time/tck/java/time/format/TCKDateTimeFormatterBuilder.java

The existing tests above can cover the cases where there are no non-ChronoFields, so no additional tests are needed.

mlbridge · 2025-11-25T06:32:33Z

Webrevs

liach

I think instead of checking each component printer parser, we should check the public methods on DateTimeFormatterBuilder that can take a TemporalField and track the onlyChronoField there.

This is better because this is where users can actaully pass in non-ChronoField. For example, I last time discovered text printer parser, and now have discovered DefaultValueParser is problematic too.

So I believe guarding where users can pass custom TemporalField and adding a boolean field on a DateTimeFormatterBuilder to keep track of this is better.

RogerRiggs · 2025-12-01T14:12:42Z

Spreading out and duplicating the state across multiple classes isn't very satisfactory.
Since non-ChronoField is very unlikely, I'd suggest a more localized change confined to Parsed.
Always create the initial EnumMap and refactor the fieldValues.put() calls to a private utility method to catch the ClassCatchException and upgrade the map to a HashMap.
That should retain the performance improvements without any extra overhead or non-local code changes for all of the normal cases.

naotoj · 2025-12-01T17:21:35Z

Since non-ChronoField is very unlikely, I'd suggest a more localized change confined to Parsed.

+1. Never seen non-ChronoField in the wild

wenshao · 2025-12-02T01:51:33Z

Spreading out and duplicating the state across multiple classes isn't very satisfactory. Since non-ChronoField is very unlikely, I'd suggest a more localized change confined to Parsed. Always create the initial EnumMap and refactor the fieldValues.put() calls to a private utility method to catch the ClassCatchException and upgrade the map to a HashMap. That should retain the performance improvements without any extra overhead or non-local code changes for all of the normal cases.

I also plan to upgrade EnumMap to a custom ChronoFieldMap, like this: wenshao@b1cbc62 Keeping the current implementation would be easier.

If we upgrade to ChronoFieldMap, it will throw a ClassClastException not only in put, but also in other methods such as get/constainsKey, which would require too many changes.

wenshao · 2025-12-02T02:16:57Z

We should place more processing logic in the pattern parsing stage, rather than the text parsing stage.

liach

I think from your experiments, maintaining onlyChronoField is indeed way too painful. So I support updating the map in Parsed to use a custom implemented map. This should be not as risky as that map is never exposed to the public users.

src/java.base/share/classes/java/time/format/DateTimeFormatter.java

This reverts commit b2b19e1.

wenshao · 2025-12-09T13:32:29Z

As shown in the image above, the DateTimeFormatterBuilder#appendValue method does not need to call checkField; it only needs to be called within appendInternal.

liach

This looks reasonable in principle. However, we need to verify we indeed won't run into putting non-chronofield into an enum map by accident, and this is a bit hard...

/reviewers 2 reviewer

openjdk · 2025-12-16T01:59:01Z

@liach
The total number of required reviews for this PR (including the jcheck configuration and the last /reviewers command) is now set to 2 (with at least 2 Reviewers).

jdlib · 2025-12-17T09:01:09Z

src/java.base/share/classes/java/time/format/Parsed.java

     */
-    Parsed() {
+    @SuppressWarnings("unchecked")
+    Parsed(boolean onlyChronoField) {


If you know that only ChronoFields are used then imho the loop over the entries of fieldValues in method resolveFields can be skipped (line 290ff).

Good suggestion, but that should be a separate PR

jdlib · 2025-12-17T12:11:14Z

src/java.base/share/classes/java/time/format/DateTimeFormatter.java

+     * Flag indicating whether this formatter only uses ChronoField instances.
+     * This is used to optimize the storage of parsed field values in the Parsed class.
+     */
+    final boolean onlyChronoField;


If you add to DateTimePrinterParser the method:

public default boolean onlyChronoFields() { return true; }

and override in CompositePrinterParser, NumberPrinterParser, TextPrinterParser, DefaultValueParser with obvious implementations you should be able to get rid of this field, same in DateTimeFormatterBuilder. (Or keep the field, but initialize in the constructor from printerParser).

d8742d7

The initial version was similar to what you suggested. In the discussion above, I accepted liach's suggestion and modified it into the current implementation. I prefer the current implementation, and it will be easier to calculate chronoFieldsBitSet in the next step.

RogerRiggs · 2025-12-19T21:42:14Z

This version isn't well encapsulated and has changes across multiple files.
What was suggested at the beginning of December is prototyped in PR #28936.

wenshao · 2025-12-20T05:23:08Z

This version isn't well encapsulated and has changes across multiple files. What was suggested at the beginning of December is prototyped in PR #28936.

Using exceptions for logic control seems like a bad practice.
A DateTimeFormatter is reused multiple times. Our approach of calculating onlyChronoField only runs once during construction. However, using a runtime approach with try-catch ClassCastException executes the corresponding code every time parse is called. This is a choice between executing once and executing multiple times.
In the DateTimeFormatter::checkField method, we could later add an int chronoFieldBitSet field to record which ChronoFields are used. This could further optimize other methods of Parsed.

liach · 2025-12-20T15:42:52Z

I just noted that custom TemporalField implementations must be able to put any TemporalField they like into this map:

jdk/src/java.base/share/classes/java/time/temporal/TemporalField.java

Lines 364 to 379 in 2d09284

    
                * @param fieldValues  the map of fields to values, which can be updated, not null 
        
                * @param partialTemporal  the partially complete temporal to query for zone and 
        
                *  chronology; querying for other things is undefined and not recommended, not null 
        
                * @param resolverStyle  the requested type of resolve, not null 
        
                * @return the resolved temporal object; null if resolving only 
        
                *  changed the map, or no resolve occurred 
        
                * @throws ArithmeticException if numeric overflow occurs 
        
                * @throws DateTimeException if resolving results in an error. This must not be thrown 
        
                *  by querying a field on the temporal without first checking if it is supported 
        
                */ 
        
               default TemporalAccessor resolve( 
        
                       Map<TemporalField, Long> fieldValues, 
        
                       TemporalAccessor partialTemporal, 
        
                       ResolverStyle resolverStyle) { 
        
                   return null; 
        
               }

Roger's model will fail if a non-TemporalField puts into this map. This PR's model is safe because all TemporalField here will be ChronoField which won't try to do dangerous stuff to the map.

RogerRiggs

This PR has been through too many incremental changes.
I suspect a better solution is to implement a fit-for-purpose Map, optimized for ChronoFields but taking into account the possibility of unknown TemporalFields. All within the implementation of a Map<TemporalField, Long>.
I'd like to see this PR closed and take a fresh look with all that is learned by the attempt.

wenshao · 2025-12-21T03:09:08Z

This PR has been through too many incremental changes. I suspect a better solution is to implement a fit-for-purpose Map, optimized for ChronoFields but taking into account the possibility of unknown TemporalFields. All within the implementation of a Map<TemporalField, Long>. I'd like to see this PR closed and take a fresh look with all that is learned by the attempt.

I believe that tasks that can be performed during the build process should not be done during the parse process.

The process of building a DateTimeBuilder is executed once, while the parsing process is executed N times.

For example, a pattern like yyyy-MM-dd HH:mm:ss.SSS requires calling the put method of the Map 7 times during the parsing process.

Therefore, I think we should check whether chronoFieldOnly is used in DateTimeFormatterBuilder.

RogerRiggs · 2025-12-22T14:31:16Z

Given early comments about parsing, I'd expect further work to allow queries of the Map testing for the fields needed by common patterns. A specialized Map could use a bitmap/array for the ChronoFields and test for multiple fields at a time.
A specialized Map could have a putChronoField method that would bypass extra testing on the type, it would be used by the implementation in Parsed maintaining encapsulation.
There is an edge case that could be used for an custom implementation of TemporalField in which the TemporField.resolve implementation for the new custom field could put a new non-ChronoField field into the map.

liach · 2025-12-22T15:57:54Z

I think this work may be accepted for now for its immediate performance gain. Properly implementing a Map is a more complex task that is no less error prone compared to this onlyChronoField boolean tracker field.

RogerRiggs · 2025-12-22T16:10:23Z

I primarily object to the spread of state across multiple classes where it is not needed.
Accepting short term gains, just puts off final solutions and tends to muddy the implementation.

wenshao added 2 commits November 24, 2025 11:18

add benchmark

b649557

use EnumMap

d8742d7

wenshao changed the title ~~Improve DateTimeFormatter::parse performance by using EnumMap~~ Use EnumMap to improve DateTimeFormatter parse performance Nov 24, 2025

openjdk bot added core-libs [email protected] i18n [email protected] labels Nov 24, 2025

wenshao changed the title ~~Use EnumMap to improve DateTimeFormatter parse performance~~ Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance Nov 24, 2025

liach suggested changes Nov 25, 2025

View reviewed changes

src/java.base/share/classes/java/time/format/DateTimeFormatterBuilder.java Outdated Show resolved Hide resolved

src/java.base/share/classes/java/time/format/Parsed.java Outdated Show resolved Hide resolved

bug fix, form @liach

5a050fe

wenshao changed the title ~~Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance~~ 8372460: Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance Nov 25, 2025

wenshao marked this pull request as ready for review November 25, 2025 06:26

openjdk bot added the rfr Pull request is ready for review label Nov 25, 2025

copyright

7137d9e

wenshao changed the title ~~8372460: Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance~~ Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance Nov 25, 2025

wenshao changed the title ~~Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance~~ 8372460: Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance Nov 25, 2025

wenshao requested a review from liach November 28, 2025 10:59

liach suggested changes Nov 30, 2025

View reviewed changes

wenshao added 2 commits December 1, 2025 08:09

from @liach

9a0ad61

bug fix

073e2b8

liach suggested changes Dec 3, 2025

View reviewed changes

src/java.base/share/classes/java/time/format/DateTimeFormatter.java Show resolved Hide resolved

src/java.base/share/classes/java/time/format/DateTimeFormatter.java Show resolved Hide resolved

resolverFields, from @liach

b2b19e1

wenshao requested a review from liach December 5, 2025 07:49

wenshao added 2 commits December 9, 2025 14:09

Revert "resolverFields, from @liach"

90dca37

This reverts commit b2b19e1.

remove redundant checkField

9a13e51

liach approved these changes Dec 16, 2025

View reviewed changes

openjdk bot added the ready Pull request is ready to be integrated label Dec 16, 2025

openjdk bot removed the ready Pull request is ready to be integrated label Dec 16, 2025

wenshao mentioned this pull request Dec 16, 2025

Speed up DateTime parse & format #28790

Draft

3 tasks

jdlib reviewed Dec 17, 2025

View reviewed changes

RogerRiggs mentioned this pull request Dec 19, 2025

8372460: Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance #28936

Closed

3 tasks

khanbilal732 approved these changes Dec 20, 2025

View reviewed changes

RogerRiggs suggested changes Dec 20, 2025

View reviewed changes

8372460: Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance #28471

Are you sure you want to change the base?

8372460: Use EnumMap instead of HashMap for DateTimeFormatter parsing to improve performance #28471

Conversation

wenshao commented Nov 24, 2025 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Progress

Issue

Reviewers

Reviewers without OpenJDK IDs

Reviewing

Uh oh!

bridgekeeper bot commented Nov 24, 2025

Uh oh!

openjdk bot commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openjdk bot commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wenshao commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. Shell

2. Raw Benchmark Data

3. Performance Comparison

Uh oh!

Uh oh!

Uh oh!

wenshao commented Nov 25, 2025

Uh oh!

mlbridge bot commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Webrevs

Uh oh!

liach left a comment

Choose a reason for hiding this comment

Uh oh!

RogerRiggs commented Dec 1, 2025

Uh oh!

naotoj commented Dec 1, 2025

Uh oh!

wenshao commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wenshao commented Dec 2, 2025

Uh oh!

liach left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

wenshao commented Dec 9, 2025

Uh oh!

liach left a comment

Choose a reason for hiding this comment

Uh oh!

openjdk bot commented Dec 16, 2025

Uh oh!

jdlib Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

wenshao Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

jdlib Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

wenshao Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

RogerRiggs commented Dec 19, 2025

Uh oh!

wenshao commented Dec 20, 2025

Uh oh!

liach commented Dec 20, 2025

Uh oh!

RogerRiggs left a comment

Choose a reason for hiding this comment

Uh oh!

wenshao commented Dec 21, 2025

wenshao commented Nov 24, 2025 •

edited by openjdk bot

Loading

openjdk bot commented Nov 24, 2025 •

edited

Loading

openjdk bot commented Nov 24, 2025 •

edited

Loading

wenshao commented Nov 24, 2025 •

edited

Loading

mlbridge bot commented Nov 25, 2025 •

edited

Loading

wenshao commented Dec 2, 2025 •

edited

Loading