integrate mxfp8 dim1 cast kernel choice enum into MXLinear #2554

danielvegamyhre · 2025-07-15T20:31:05Z

Stacked PRs:

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

Summary

Add MXFP8Dim1CastKernelChoice enum and replace all uses of boolean flag use_fp8_dim1_cast_triton_kernel with it. (Default to Triton for now)
Update tests accordingly and verify they are passing.

Test plan

pytest test/prototype/mx_formats/test_mx_linear.py -k eager_vs_hp
pytest test/prototype/mx_formats/test_mx_linear.py -k compile

Next steps

Integrate into torchtitan for e2e fsdp training tests once this stack lands. Torchtitan PR: [mxpf8] Make mxfp8 dim1 cast kernel configurable torchtitan#1401
Dtensor tests still having issues both with Triton and CUDA: ./test/prototype/mx_formats/test_mx_dtensor.sh
- Triton error (known issue): RuntimeError: Attempting to use FunctionalTensor on its own. Instead, please use it with a corresponding FunctionalTensorMode()
- Cuda error in dtensor op dispatch: assert res.ndim == 0, "output tensor should be scalar!"

pytorch-bot · 2025-07-15T20:31:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2554

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 5 Pending

As of commit b7c508c with merge base 95d13d5 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre · 2025-07-15T20:41:28Z

cc @vkuzo for review of this stack. Changes are tested via the tests ran in "test plan" section, which have been updated to test all 3 dim1 cast kernel choices (none, triton, cuda).

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

vkuzo · 2025-07-16T11:46:21Z

torchao/prototype/mx_formats/config.py

@@ -33,6 +33,11 @@ class MXGemmKernelChoice(Enum):
    CUBLAS = "cublas"


+class MXFP8Dim1CastKernelChoice(Enum):


nit: name it to support dim0_dim1 cast if that is added in the future? MXFP8CastKernelChoice?

vkuzo · 2025-07-16T11:47:44Z

torchao/prototype/mx_formats/config.py

    # TODO(1945): remove this config option once torch.compile gives us
    # a fast kernel
-    use_fp8_dim1_cast_triton_kernel: bool = False
+    mxfp8_dim1_cast_kernel_choice: Optional[MXFP8Dim1CastKernelChoice] = (


how would someone use torch.compile generated kernel?

By setting mxfp8_dim1_cast_kernel_choice=None. Perhaps we should make it more explicit, by adding a 3rd enum option for MXFP8CastKernelChoice.TORCH?

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/9 branch from d858130 to b01a184 Compare July 15, 2025 20:31

danielvegamyhre added a commit that referenced this pull request Jul 15, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

380e887

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 97e9a7b to 380e887 Compare July 15, 2025 20:31

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 15, 2025

danielvegamyhre added mx topic: not user facing Use this tag if you don't want this PR to show up in release notes labels Jul 15, 2025

danielvegamyhre requested review from vkuzo and drisspg July 15, 2025 20:41

danielvegamyhre mentioned this pull request Jul 15, 2025

[mxpf8] Make mxfp8 dim1 cast kernel configurable pytorch/torchtitan#1401

Closed

danielvegamyhre changed the base branch from danielvegamyhre/stack/9 to main July 15, 2025 23:43

danielvegamyhre added a commit that referenced this pull request Jul 15, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

6311b94

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 380e887 to 6311b94 Compare July 15, 2025 23:43

danielvegamyhre changed the base branch from main to danielvegamyhre/stack/9 July 15, 2025 23:43

vkuzo reviewed Jul 16, 2025

View reviewed changes

danielvegamyhre changed the base branch from danielvegamyhre/stack/9 to main July 16, 2025 15:25

danielvegamyhre added a commit that referenced this pull request Jul 16, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

8a86dae

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 6311b94 to 8a86dae Compare July 16, 2025 15:26

danielvegamyhre changed the base branch from main to danielvegamyhre/stack/9 July 16, 2025 15:26

danielvegamyhre changed the base branch from danielvegamyhre/stack/9 to main July 16, 2025 15:33

danielvegamyhre added a commit that referenced this pull request Jul 16, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

ab8e821

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 8a86dae to ab8e821 Compare July 16, 2025 15:33

danielvegamyhre changed the base branch from main to danielvegamyhre/stack/9 July 16, 2025 15:33

danielvegamyhre changed the base branch from danielvegamyhre/stack/9 to main July 16, 2025 16:14

danielvegamyhre added a commit that referenced this pull request Jul 16, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

7a48119

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from ab8e821 to 7a48119 Compare July 16, 2025 16:14

danielvegamyhre changed the base branch from main to danielvegamyhre/stack/9 July 16, 2025 16:15

danielvegamyhre changed the base branch from danielvegamyhre/stack/9 to main July 16, 2025 16:18

danielvegamyhre added a commit that referenced this pull request Jul 16, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

622b26d

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 7a48119 to 622b26d Compare July 16, 2025 16:18

danielvegamyhre changed the base branch from main to danielvegamyhre/stack/9 July 16, 2025 16:19

danielvegamyhre changed the base branch from danielvegamyhre/stack/9 to main July 16, 2025 16:22

danielvegamyhre added a commit that referenced this pull request Jul 16, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

3efc354

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 622b26d to 3efc354 Compare July 16, 2025 16:22

danielvegamyhre changed the base branch from main to danielvegamyhre/stack/9 July 16, 2025 16:22

danielvegamyhre changed the base branch from danielvegamyhre/stack/9 to main July 16, 2025 17:19

danielvegamyhre added a commit that referenced this pull request Jul 16, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

80fc9d0

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 3efc354 to 80fc9d0 Compare July 16, 2025 17:19

danielvegamyhre changed the base branch from main to danielvegamyhre/stack/9 July 16, 2025 17:19

danielvegamyhre force-pushed the danielvegamyhre/stack/9 branch from 583c8c5 to 2938614 Compare July 16, 2025 20:36

danielvegamyhre added a commit that referenced this pull request Jul 16, 2025

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

5d5f087

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 80fc9d0 to 5d5f087 Compare July 16, 2025 20:36

integrate mxfp8 dim1 cast kernel choice enum into MXLinear

b7c508c

stack-info: PR: #2554, branch: danielvegamyhre/stack/10

danielvegamyhre changed the base branch from danielvegamyhre/stack/9 to main July 16, 2025 20:40

danielvegamyhre force-pushed the danielvegamyhre/stack/10 branch from 5d5f087 to b7c508c Compare July 16, 2025 20:40

danielvegamyhre changed the base branch from main to danielvegamyhre/stack/9 July 16, 2025 20:40

danielvegamyhre closed this Jul 16, 2025

danielvegamyhre mentioned this pull request Jul 17, 2025

integration of new mxfp8 casting cuda kernel #2564

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

integrate mxfp8 dim1 cast kernel choice enum into MXLinear #2554

integrate mxfp8 dim1 cast kernel choice enum into MXLinear #2554

Uh oh!

danielvegamyhre commented Jul 15, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 15, 2025 •

edited

Loading

Uh oh!

danielvegamyhre commented Jul 15, 2025

Uh oh!

vkuzo Jul 16, 2025

Uh oh!

vkuzo Jul 16, 2025

Uh oh!

danielvegamyhre Jul 16, 2025

Uh oh!

Uh oh!

		@@ -33,6 +33,11 @@ class MXGemmKernelChoice(Enum):
		CUBLAS = "cublas"


		class MXFP8Dim1CastKernelChoice(Enum):

integrate mxfp8 dim1 cast kernel choice enum into MXLinear #2554

integrate mxfp8 dim1 cast kernel choice enum into MXLinear #2554

Uh oh!

Conversation

danielvegamyhre commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!