Add vLLM x TorchAO integration workflow #2610

huydhn · 2025-07-26T00:36:57Z

This adds a workflow to run vLLM x TorchAO tests with the latest vLLM main. The setup is vLLM main x PyTorch stable x FBGEMM stable x TorchAO PR/main commits.

If there are not enough H100 capacity and you observe queueing, please let me know. We might need to tweak the workflow to run on H100 only when pushing to main.

Testing

https://github.com/pytorch/ao/actions/runs/16535494171/job/46769051375

Signed-off-by: Huy Do <[email protected]>

pytorch-bot · 2025-07-26T00:37:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2610

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit a6b2aaa with merge base 5fe4ebd ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Signed-off-by: Huy Do <[email protected]>

huydhn · 2025-07-26T02:41:59Z

@jerryzh168 I run the same logic to install fbgemm-gen-ai from nightly and build TorchAO, but running pytest currently ends up with this error loading fbgemm https://github.com/pytorch/ao/actions/runs/16535051236/job/46767869902?pr=2610#step:7:400. Do you know what I did wrong here?

I need to install fbgemm-gen-ai from stable to avoid rebuilding it. I think we could stay with that, but let me know if it's ok. In this setup we have, vLLM main x PyTorch stable x FBGEMM stable x TorchAO PR/main commits

Signed-off-by: Huy Do <[email protected]>

huydhn · 2025-07-26T03:37:06Z

@pytorchbot drci

Signed-off-by: Huy Do <[email protected]>

jerryzh168 · 2025-07-28T19:47:06Z

current torchao main doesn't work with fbgemm gpu genai stable I think, maybe just use stable for everything and we'll make sure next stable torchao release can work with fbgemm gpu genai stable

huydhn · 2025-07-28T20:40:08Z

current torchao main doesn't work with fbgemm gpu genai stable I think, maybe just use stable for everything and we'll make sure next stable torchao release can work with fbgemm gpu genai stable

Interesting, the tests pass https://github.com/pytorch/ao/actions/runs/16537610493?pr=2610, it means that torchao main is working with vLLM main. Is that what we are looking for here? I guess if we need to test torchao vs fbgemm gpu genai, it makes more sense to have another workflow to test only these twos?

jerryzh168 · 2025-07-28T20:44:25Z

current torchao main doesn't work with fbgemm gpu genai stable I think, maybe just use stable for everything and we'll make sure next stable torchao release can work with fbgemm gpu genai stable

Interesting, the tests pass pytorch/ao/actions/runs/16537610493?pr=2610, it means that torchao main is working with vLLM main. Is that what we are looking for here? I guess if we need to test torchao vs fbgemm gpu genai, it makes more sense to have another workflow to test only these twos?

OK it could be because we haven't landed everything we want in main yet, and some of our new changes would require fbgemm main

we do have tests to test torchao + fbgemm gpu genai (nightly) in our CI

jerryzh168 · 2025-07-29T04:08:04Z

.github/workflows/vllm_ao_integration_test.yml

+on:
+  push:
+    branches:
+      - main


nit: nightly might be enough I think

jerryzh168 · 2025-07-29T04:11:17Z

.github/scripts/run_vllm_tests.sh

+set -eux
+
+# vLLM docker image is using CUDA 12.8 and python 3.12
+pip install --pre fbgemm-gpu-genai --index-url https://download.pytorch.org/whl/cu128


fbgemm gpu genai nightly would require pytorch nightly I think, and with that we probably need vllm to depend no torch nightly, is that doable with docker? or docker is build with stable pytorch?

Add vLLM x ao integration workflow

aa56d97

Signed-off-by: Huy Do <[email protected]>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 26, 2025

huydhn added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Jul 26, 2025

huydhn added 11 commits July 25, 2025 17:38

Typo

f2b9464

Signed-off-by: Huy Do <[email protected]>

Another bug

7ba5206

Signed-off-by: Huy Do <[email protected]>

Debug

90f3b54

Signed-off-by: Huy Do <[email protected]>

Figure it out, use the wrong image

84449d1

Signed-off-by: Huy Do <[email protected]>

Typo

772e142

Signed-off-by: Huy Do <[email protected]>

Another round of testing

8c86506

Signed-off-by: Huy Do <[email protected]>

Typo

5ee466d

Signed-off-by: Huy Do <[email protected]>

Wrong Docker image env

e6aee4b

Signed-off-by: Huy Do <[email protected]>

Darn

4ce4923

Signed-off-by: Huy Do <[email protected]>

Just use pip

ed26e9b

Signed-off-by: Huy Do <[email protected]>

Checkout submodules

6baecff

Signed-off-by: Huy Do <[email protected]>

huydhn requested a review from jerryzh168 July 26, 2025 02:38

huydhn marked this pull request as ready for review July 26, 2025 02:40

huydhn changed the title ~~Add vLLM x ao integration workflow~~ Add vLLM x TorchAO integration workflow Jul 26, 2025

huydhn added 2 commits July 25, 2025 19:46

Use fbgemm stable

8665c73

Signed-off-by: Huy Do <[email protected]>

Use the correct test file

8c11167

Signed-off-by: Huy Do <[email protected]>

huydhn added 3 commits July 25, 2025 20:46

Does this work?

0c00696

Signed-off-by: Huy Do <[email protected]>

Let's see

b518521

Signed-off-by: Huy Do <[email protected]>

Does this work?

8038daa

Signed-off-by: Huy Do <[email protected]>

huydhn mentioned this pull request Jul 26, 2025

Add daily lib integration test #2601

Open

Using matrix in if doesn't work

a6b2aaa

Signed-off-by: Huy Do <[email protected]>

jerryzh168 reviewed Jul 29, 2025

View reviewed changes

.github/workflows/vllm_ao_integration_test.yml

on:

push:

branches:

- main

Copy link

Contributor

jerryzh168 Jul 29, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: nightly might be enough I think

jerryzh168 reviewed Jul 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add vLLM x TorchAO integration workflow #2610

Add vLLM x TorchAO integration workflow #2610

Uh oh!

huydhn commented Jul 26, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 26, 2025 •

edited

Loading

Uh oh!

huydhn commented Jul 26, 2025 •

edited

Loading

Uh oh!

huydhn commented Jul 26, 2025

Uh oh!

jerryzh168 commented Jul 28, 2025

Uh oh!

huydhn commented Jul 28, 2025

Uh oh!

jerryzh168 commented Jul 28, 2025

Uh oh!

jerryzh168 Jul 29, 2025

Uh oh!

jerryzh168 Jul 29, 2025

Uh oh!

Uh oh!

Add vLLM x TorchAO integration workflow #2610

Are you sure you want to change the base?

Add vLLM x TorchAO integration workflow #2610

Uh oh!

Conversation

huydhn commented Jul 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Uh oh!

pytorch-bot bot commented Jul 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2610

✅ No Failures

Uh oh!

huydhn commented Jul 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

huydhn commented Jul 26, 2025

Uh oh!

jerryzh168 commented Jul 28, 2025

Uh oh!

huydhn commented Jul 28, 2025

Uh oh!

jerryzh168 commented Jul 28, 2025

Uh oh!

jerryzh168 Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

huydhn commented Jul 26, 2025 •

edited

Loading

pytorch-bot bot commented Jul 26, 2025 •

edited

Loading

huydhn commented Jul 26, 2025 •

edited

Loading