Add torch 2.9 in regression tests #3311

jainapurva · 2025-11-07T19:53:51Z

Add regression test to test torchao against the latest torch 2.9.1

Minor update: Change torch from 2.7.0 -> 2.7.1

pytorch-bot · 2025-11-07T19:53:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3311

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d2192b1 with merge base e2aab90 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

… 16 (#3309) Summary: The underlying fbgemm conv3d kernel for float8 only supports channels_out/channels_in are both multiples of 16 so we skip the shapes that doesn't satisfy the requirements for now, we can expand the support to do padding if needed in the future Test Plan: python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_skip_quant

…f16 at scale (#3312)

* Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned] * Update [ghstack-poisoned]

* Remove config functions like `int4_weight_only` **Summary:** As a follow-up to #2994, this commit removes all quantization functions that were used as configs. These functions were deprecated in 0.14.0 and will be removed in the next release, 0.15.0. **Test Plan:** CI [ghstack-poisoned] * Remove old TORCH_VERSION variables **Summary:** As a follow-up to #2719, which deprecated these variables in 0.13.0, we remove them now in the next release 0.15.0. **Test Plan:** CI [ghstack-poisoned] * Update base for Update on "Remove old TORCH_VERSION variables" **Summary:** As a follow-up to #2719, which deprecated these variables in 0.13.0, we remove them now in the next release 0.15.0. **Test Plan:** CI [ghstack-poisoned] * Update base for Update on "Remove old TORCH_VERSION variables" **Summary:** As a follow-up to #2719, which deprecated these variables in 0.13.0, we remove them now in the next release 0.15.0. **Test Plan:** CI [ghstack-poisoned]

Summary: Add fp8 conv2d support, using the same conv3d kernels, by setting the D dimension to 1. 1. unsqueeze both input and weight in dim 2 ( the D dimension) 2. call fp8 conv3d op from fbgemm `torch.ops.fbgemm.f8f8bf16_conv` 3. assert D dimension shape to be 1 and call sequeeze at dim 2: res.squeeze(2) to remove the D dimension Test Plan: python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_unsqueeze_conv2d_weight python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_conv_variants

Signed-off-by: Huy Do <[email protected]>

* build common used toy linear model Co-authored-by: Jerry Zhang <[email protected]> * update model to use direct input * revert unit test skip

* Remove devtoolset install * Update regression_test.yml * Update regression_test.yml

* Adds __str__ to FqnToConfig to make printing more readable Summary: att, adds `__str__` method to `FqnToConfig` so that printing is more legible. For some config: ```python config = FqnToConfig({ "model.layers.fig.1.1": Float8DynamicActivationFloat8WeightConfig( granularity=PerRow(), ), "model.layers.fig.1.3": Float8DynamicActivationFloat8WeightConfig( granularity=PerRow(), ), "model.layers.fig.8.3": Float8DynamicActivationFloat8WeightConfig( granularity=PerRow(), ), }) ``` the output will be: ``` FqnToConfig({ 'model.layers.fig.1.1': Float8DynamicActivationFloat8WeightConfig(activation_dtype=torch.float8_e4m3fn, weight_dtype=torch.float8_e4m3fn, granularity=[PerRow(dim=-1), PerRow(dim=-1)], mm_config=Float8MMConfig(emulate=False, use_fast_accum=True, pad_inner_dim=False), activation_value_lb=None, activation_value_ub=None, kernel_preference=<KernelPreference.AUTO: 'auto'>, set_inductor_config=True, version=2), 'model.layers.fig.1.3': Float8DynamicActivationFloat8WeightConfig(activation_dtype=torch.float8_e4m3fn, weight_dtype=torch.float8_e4m3fn, granularity=[PerRow(dim=-1), PerRow(dim=-1)], mm_config=Float8MMConfig(emulate=False, use_fast_accum=True, pad_inner_dim=False), activation_value_lb=None, activation_value_ub=None, kernel_preference=<KernelPreference.AUTO: 'auto'>, set_inductor_config=True, version=2), 'model.layers.fig.8.3': Float8DynamicActivationFloat8WeightConfig(activation_dtype=torch.float8_e4m3fn, weight_dtype=torch.float8_e4m3fn, granularity=[PerRow(dim=-1), PerRow(dim=-1)], mm_config=Float8MMConfig(emulate=False, use_fast_accum=True, pad_inner_dim=False), activation_value_lb=None, activation_value_ub=None, kernel_preference=<KernelPreference.AUTO: 'auto'>, set_inductor_config=True, version=2), }) ``` also adds in a test so that you cannot specify both fqn_to_config and module_fqn_to_config unless they are both equal. Test Plan: ``` pytest test/quantization/test_quant_api.py -k test_fqn_config_module_config_and_fqn_config_both_specified ``` Reviewers: Subscribers: Tasks: Tags: * fix ruff check

Summary: att, we added this to float8_inference_roofline to reuse code but we haven't enabled the roofline feature. For now we just need the e2e speedup time for single conv2d/conv3d against bf16 to understand the speedup expecatation Also added B200 hardware spec. Test Plan: python $SCRIPT_PATH $OUTPUT_FILE \ --recipe_name $RECIPE_NAME \ --shape_gen_name $SHAPE_GEN_NAME \ --M $M --K $K --N $N \ --D $D --H $H --W $W \ --kernel_size $kernel_size \ --op_name conv3d This doesn't run yet because OSS fbgemm can't be installed in the B200 machine Reviewers: Subscribers: Tasks: Tags: Co-authored-by: jerryzh <[email protected]>

andrewor14 · 2025-11-14T15:31:52Z

torchao/quantization/pt2e/utils.py

+            aten_pattern.graph.erase_node(node)  # type: ignore[operator, union-attr]
+            # Also remove the _guards_fn module from the graph module if it exists
+            if hasattr(aten_pattern, "_guards_fn"):
+                delattr(aten_pattern, "_guards_fn")


I ran into this myself yesterday. This was resolved if I upgrade to the latest main (torch-2.10.0dev), do we still need to do this? Will CI fail without this?

Add torch2.9 in regression tests

c7ecb1e

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 7, 2025

jainapurva added 2 commits November 12, 2025 14:03

Update torch version to 2.9.1 in regression tests

e9f94ba

Update torch version from 2.7.0 to 2.7.1

886f0a6

jainapurva added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Nov 12, 2025

jainapurva and others added 18 commits November 13, 2025 18:15

Move dyn_int8_act_int4_wei_cpu_layout to prototype/dtypes (#3299)

1a9a13f

[mxfp8 moe training][BE] add docs showing equivalent convergence to b…

02ecbb7

…f16 at scale (#3312)

Move marlin_qqq_tensor to prototype/dtypes (#3307)

e4ecec0

Pin pytest==8.4.2 (#3321)

bab6ce5

Signed-off-by: Huy Do <[email protected]>

Update common used toy linear model (#3275)

8bce9b1

* build common used toy linear model Co-authored-by: Jerry Zhang <[email protected]> * update model to use direct input * revert unit test skip

Use conda libgcc-ng 11.2 (#3327)

4a102c2

* Remove devtoolset install * Update regression_test.yml * Update regression_test.yml

Move gemlite layout to prototype/dtypes (#3313)

5c3e652

Move uintx_layout to prototype/dtypes (#3316)

7213f81

Move floatx_tensor_core_layout to prototype/dtypes (#3317)

8c37568

Use conda libgcc-ng 11.2 for nightly tests (#3326)

d7b537b

Fix tests

9ba0a3f

Merge origin/main into add_torch2.9_tests

d2192b1

jainapurva requested review from andrewor14 and jerryzh168 November 13, 2025 20:22

jainapurva marked this pull request as ready for review November 13, 2025 22:19

andrewor14 reviewed Nov 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add torch 2.9 in regression tests #3311

Add torch 2.9 in regression tests #3311

Uh oh!

jainapurva commented Nov 7, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 7, 2025 •

edited

Loading

Uh oh!

andrewor14 Nov 14, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Add torch 2.9 in regression tests #3311

Are you sure you want to change the base?

Add torch 2.9 in regression tests #3311

Uh oh!

Conversation

jainapurva commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3311

✅ No Failures

Uh oh!

andrewor14 Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

jainapurva commented Nov 7, 2025 •

edited

Loading

pytorch-bot bot commented Nov 7, 2025 •

edited

Loading

andrewor14 Nov 14, 2025 •

edited

Loading