OCaml 5.4 syntax support #2720

Octachron · 2025-08-08T15:19:11Z

This PR updates the vendored parsers and related types to mirror the 5.4 version of the OCaml parser and adds a basic support for labeled tuples and bivariance.

The parser updates means that it is possible to locate comments between Longident.t component, and it is always possible to distinguish between (module M): (module S) at least for expressions.

The support for labeled expressions is bit ad-hoc and hackish in this version due to the way that labeled tuple element interact with parenthesis, for instance inside:

let t = ~x:(Fun.id 0), 1

and it could definitively be improved.

- update vendored parsers to mirror upstream at 5.4: * introduce locations for Longident.t components * distinguish (module M:S) and ((module M):(module S)) for expressions - support for new syntaxes: * bivariance * labelled tuples

Julow

Sorry for the slow reply. This is awesome :) Thanks a lot!

vendor/parser-extended/parsetree.mli

Julow · 2025-08-29T10:37:50Z

vendor/parser-extended/parser.mly

@@ -3185,10 +3316,12 @@ type_variance:
  | MINUS BANG  { [ mkvarinj "-" $loc($1); mkvarinj "!" $loc($2) ] }
  | BANG MINUS  { [ mkvarinj "!" $loc($1); mkvarinj "-" $loc($2) ] }
  | INFIXOP2
-      { if ($1 = "+!") || ($1 = "-!") then [ mkvarinj $1 $sloc ]
+      { if $1 = "+!" ||  $1 = "-!" || $1 = "+-"|| $1 = "-+"


Do you think we could accept anything starting in +, - or ! here ? The standard parser will catch errors there anyway.

From the parser point of view, we could accept any INFIXOP2. And I agree that it probably better to accept this when building the CST.

Julow · 2025-08-29T15:33:09Z

lib/Fmt_ast.ml

@@ -2822,13 +2863,36 @@ and fmt_expression c ?(box = true) ?(pro = noop) ?eol ?parens
      in
      let outer_wrap = has_attr && parens in
      let inner_wrap = has_attr || parens in
+      let with_label (lbl, exp) =


Instead handling special cases for ( ~foo, ~(bar : t) ), etc.. do you think we could represent the concrete syntax in the parsetree ?
I mean something like type tuple_label = Tl_pun of string loc | Tl_constraint of string loc * core_type | Tl_normal of string loc

Are case where people don't want to normalize ~x:x, ~y to ~x, ~y ? If yes, it is probably better to separate the two forms in the parsetree. Otherwise, I am not sure if this is that useful?

I think it's useful to make the formatting code simpler and less buggy. Decoding the AST was a major source of bugs in the past.
We generally do the normalization by modifying the AST: https://github.com/ocaml-ppx/ocamlformat/blob/main/lib/Extended_ast.ml#L68

I'd be happy to do the changes if you don't have the time ?

After experimenting a bit more, I agree that it feels better to not have a straight mapping of locations with punning. I will send my updated commits after some more tests.

Octachron mentioned this pull request Jul 29, 2025

OCaml 5.4.0 release readiness ocaml/opam-repository#27916

Open

18 tasks

Octachron added 4 commits August 8, 2025 17:30

Basic OCaml 5.4 support

a07793a

- update vendored parsers to mirror upstream at 5.4: * introduce locations for Longident.t components * distinguish (module M:S) and ((module M):(module S)) for expressions - support for new syntaxes: * bivariance * labelled tuples

Add test for labeled tuples

9d99992

Test bivariance

6deaae3

update CHANGES.md

201949e

Octachron force-pushed the OCaml_5.4_support branch from eabfcee to 201949e Compare August 8, 2025 15:49

Julow reviewed Aug 29, 2025

View reviewed changes

Octachron added 5 commits September 1, 2025 13:21

review: lax rule for variance annotation

8689441

review: test comments inside tuple types

8d418b0

review: track labels locations in tuple types

f438e4b

WIP: locations for labels

c8325a9

Concrete node for tuple element punning

7cc3534

Octachron force-pushed the OCaml_5.4_support branch from fced3c8 to 7cc3534 Compare September 3, 2025 12:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OCaml 5.4 syntax support #2720

OCaml 5.4 syntax support #2720

Uh oh!

Octachron commented Aug 8, 2025

Uh oh!

Julow left a comment

Uh oh!

Uh oh!

Julow Aug 29, 2025

Uh oh!

Octachron Sep 1, 2025

Uh oh!

Julow Aug 29, 2025

Uh oh!

Octachron Sep 1, 2025

Uh oh!

Julow Sep 2, 2025

Uh oh!

Octachron Sep 2, 2025

Uh oh!

Uh oh!

OCaml 5.4 syntax support #2720

Are you sure you want to change the base?

OCaml 5.4 syntax support #2720

Uh oh!

Conversation

Octachron commented Aug 8, 2025

Uh oh!

Julow left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Julow Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

Octachron Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

Julow Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

Octachron Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

Julow Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Octachron Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!