👩‍🦯 Fix usage of VLM using text only #4080

SamuelBarryCS · 2025-09-14T05:59:52Z

What

Fixes sft_gemma3 example doesn't work #3957 by checking if there is image and adapting the call to the processor accordingly
Fixes VLM config tests that were failing (mismatch in between config.text_config.num_hidden_layers and layer_types was causing the tests to fail)

How to review

Read diff

Test performed

Issue reproduced with a small script (pushed, but will be deleted before merging). Before the fix, the script yielded [rank0]: KeyError: 'images' like in sft_gemma3 example doesn't work #3957 but after the fix it is working just fine
No major logic change so existing tests still passing

trl/trainer/sft_trainer.py

SamuelBarryCS · 2025-09-14T06:22:32Z

test_vlm_text_only_issue.py

@@ -0,0 +1,31 @@
+"""Test for issue #3957 - VLM KeyError fix"""


Will be deleted before merging

SamuelBarryCS · 2025-09-14T06:28:02Z

cc @qgallouedec for review when you get the time.
Merci!

sergiopaniego

Thanks for the fix!
We could add a modified version of the script as a test in test_sft_trainer.py and remove the script

SamuelBarryCS · 2025-09-17T15:50:31Z

Thanks for the fix! We could add a modified version of the script as a test in test_sft_trainer.py and remove the script

Thanks for your quick reply @sergiopaniego !
Done in 6465594 :), and the test is passing fine:

Good to merge on my side (but it looks like we still need approval for the workflows to run)

HuggingFaceDocBuilderDev · 2025-09-17T16:21:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

SamuelBarryCS · 2025-09-22T17:16:02Z

@sergiopaniego do you understand why the tests are failing ? It doesn't look related to my changes ... :x
Otherwise ready to merge on my side (once this test issue is solved).

qgallouedec · 2025-09-23T15:56:19Z

Hi, thanks for your contribution. I opted for a different approach: when the dataset does not contain images, it can be pre-processed, and there is no point in doing it on the fly.

SamuelBarryCS · 2025-09-23T18:08:53Z

Hi, thanks for your contribution. I opted for a different approach: when the dataset does not contain images, it can be pre-processed, and there is no point in doing it on the fly.

This works as well! Thanks for the edit @qgallouedec.

SamuelBarryCS added 3 commits September 14, 2025 05:57

Fix issue

5ac5bbe

push test

a2a43c3

Push test

9bef39d

SamuelBarryCS mentioned this pull request Sep 14, 2025

sft_gemma3 example doesn't work #3957

Closed

5 tasks

SamuelBarryCS commented Sep 14, 2025

View reviewed changes

trl/trainer/sft_trainer.py Outdated Show resolved Hide resolved

SamuelBarryCS changed the title ~~[WIP] Fix usage of VLM using text only~~ Fix usage of VLM using text only Sep 14, 2025

Merge branch 'main' into fix-vlm-for-text-only-data

20f74d1

SamuelBarryCS marked this pull request as ready for review September 14, 2025 06:22

SamuelBarryCS commented Sep 14, 2025

View reviewed changes

Merge branch 'main' into fix-vlm-for-text-only-data

ff6a276

sergiopaniego approved these changes Sep 17, 2025

View reviewed changes

Add test in test_sft_trainer.py

6465594

SamuelBarryCS and others added 9 commits September 23, 2025 02:04

Fix tests ?

caa8046

doc

c54f1a2

Merge branch 'main' into fix-vlm-for-text-only-data

98d998f

revert to main

9487ead

remote min

ca6770b

Merge branch 'main' into fix-vlm-for-text-only-data

df7edee

Merge branch 'main' into fix-vlm-for-text-only-data

7e041f6

revert

5e78dd2

different approach

8d52bae

Merge branch 'main' into fix-vlm-for-text-only-data

28a0560

qgallouedec changed the title ~~Fix usage of VLM using text only~~ 👩‍🦯 Fix usage of VLM using text only Sep 23, 2025

qgallouedec merged commit 9e5e60c into huggingface:main Sep 23, 2025
4 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

👩‍🦯 Fix usage of VLM using text only #4080

👩‍🦯 Fix usage of VLM using text only #4080

SamuelBarryCS commented Sep 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

SamuelBarryCS Sep 14, 2025

Uh oh!

SamuelBarryCS commented Sep 14, 2025 •

edited

Loading

Uh oh!

sergiopaniego left a comment

Uh oh!

SamuelBarryCS commented Sep 17, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Sep 17, 2025

Uh oh!

SamuelBarryCS commented Sep 22, 2025

Uh oh!

qgallouedec commented Sep 23, 2025

Uh oh!

Uh oh!

SamuelBarryCS commented Sep 23, 2025

Uh oh!

Uh oh!

		@@ -0,0 +1,31 @@
		"""Test for issue #3957 - VLM KeyError fix"""

👩‍🦯 Fix usage of VLM using text only #4080

👩‍🦯 Fix usage of VLM using text only #4080

Conversation

SamuelBarryCS commented Sep 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

How to review

Test performed

Uh oh!

Uh oh!

SamuelBarryCS Sep 14, 2025

Choose a reason for hiding this comment

Uh oh!

SamuelBarryCS commented Sep 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sergiopaniego left a comment

Choose a reason for hiding this comment

Uh oh!

SamuelBarryCS commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 17, 2025

Uh oh!

SamuelBarryCS commented Sep 22, 2025

Uh oh!

qgallouedec commented Sep 23, 2025

Uh oh!

Uh oh!

SamuelBarryCS commented Sep 23, 2025

Uh oh!

Uh oh!

SamuelBarryCS commented Sep 14, 2025 •

edited

Loading

SamuelBarryCS commented Sep 14, 2025 •

edited

Loading

SamuelBarryCS commented Sep 17, 2025 •

edited

Loading