Add support for blackwell architecture (sm120) #735

danielealbano · 2025-10-06T12:58:43Z

What does this PR do?

This PR adds support for the Blackwell architecture, related to issue #652.

As I wanted to run TEI on my 5090 I went through a few iterations and got it working, tested with Qwen3-Embedding-0.6B.

Before submitting

Did you read the contributor guideline?

Yes

Was this discussed/approved via a GitHub issue or the forum?

Not discussed nor approved but it's a known issue and there is a related issue already opened at #652

Did you make sure to update the documentation with your changes? Here are the documentation guidelines.

Documentation updated to mention the new compute cap.

Did you write any new necessary tests? If applicable, did you include or update the insta snapshots?

I have updated the only test already in place to validate the compute cap

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…e rest of the related information

alvarobartt

Thanks for the PR, I'm just a bit concerned on bumping CUDA from 12.2 to 12.9 just for supporting Blackwell 🤔

alvarobartt · 2025-10-07T12:58:37Z

Dockerfile-cuda

@@ -1,4 +1,4 @@
-FROM nvidia/cuda:12.2.0-devel-ubuntu22.04 AS base-builder
+FROM nvidia/cuda:12.9.0-devel-ubuntu22.04 AS base-builder


Is bumping CUDA required? As it might eventually be a breaking change for instances running on older versions of NVIDIA as 12.2, 12.4 and 12.6; besides that everything LGTM

@alvarobartt CUDA 12.8 is required to support GPUs like the 5080 and 5090, we can potentially downgrade to 12.8 and it should still work (I can test) however I don't think it would help too much.

I understand that it might be a problem, however CUDA 12.2 is 2 years (July 2023) old and it would need to be upgraded at some point.

What if we the cuda 12.9 is used with a :129-1.x docker image tag? It doesn't feel the right solution but it wouldn't break any backward compatibility.

Hmm fair enough, I then think we maybe just create Dockerfile-cuda-blackwell in the meantime with CUDA 12.8, whilst keeping the rest of the changes, just adding that to the CI and making sure we build with a different CUDA version for Blackwell, and eventually for TEI v1.9.0 we can think about bumping CUDA from 12.2 to 12.6.

In any case, I guess that given how recent Blackwell is it makes sense to be isolated for the moment to not break anything, but ideally all those should be under the same Dockerfile in the future.

I will try to test with CUDA 12.8 to be certain there no odd surprises, I'll need to figure out which packages to swap to downgrade the CUDA version on my test hardware.

Awesome thanks for the contribution @danielealbano, I'll try to test on my end too, and add it into the CI to make sure the Dockerfile-cuda-blackwell image is built as experimental, and later on we can consider on bumping CUDA on the Dockerfile-cuda and Dockerfile-cuda-all images to make sure that it supports all the architectures today!

danielealbano added 4 commits October 6, 2025 14:44

Add support for sm120 (blackwell / nvidia gtx 5090 gpu support)

db7a111

Update docs

6c78e75

Update compute cap test

f4eeffd

Ensure that the runtime compute cap 120 doc string is uniform with th…

d79baf5

…e rest of the related information

danielealbano mentioned this pull request Oct 6, 2025

Add support for Blackwell architecture #652

Open

Add support for sm120 in Dockerfile-cuda-all

8f4b6c8

alvarobartt reviewed Oct 7, 2025

View reviewed changes

alvarobartt mentioned this pull request Oct 8, 2025

5090 support soon ? #640

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for blackwell architecture (sm120) #735

Add support for blackwell architecture (sm120) #735

danielealbano commented Oct 6, 2025 •

edited

Loading

Uh oh!

alvarobartt left a comment

Uh oh!

alvarobartt Oct 7, 2025 •

edited

Loading

Uh oh!

danielealbano Oct 7, 2025 •

edited

Loading

Uh oh!

alvarobartt Oct 7, 2025 •

edited

Loading

Uh oh!

danielealbano Oct 7, 2025 •

edited

Loading

Uh oh!

alvarobartt Oct 8, 2025

Uh oh!

Uh oh!

		@@ -1,4 +1,4 @@
		FROM nvidia/cuda:12.2.0-devel-ubuntu22.04 AS base-builder
		FROM nvidia/cuda:12.9.0-devel-ubuntu22.04 AS base-builder

Add support for blackwell architecture (sm120) #735

Are you sure you want to change the base?

Add support for blackwell architecture (sm120) #735

Conversation

danielealbano commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

alvarobartt left a comment

Choose a reason for hiding this comment

Uh oh!

alvarobartt Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

danielealbano Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alvarobartt Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

danielealbano Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alvarobartt Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

danielealbano commented Oct 6, 2025 •

edited

Loading

alvarobartt Oct 7, 2025 •

edited

Loading

danielealbano Oct 7, 2025 •

edited

Loading

alvarobartt Oct 7, 2025 •

edited

Loading

danielealbano Oct 7, 2025 •

edited

Loading