Feat #12: Add support for dynamic model loading with --model flag to test other VLMs #26

aditi-dsi · 2025-05-28T20:44:56Z

This PR enables dynamic loading of different VLMs based on the --model argument.

Gemma models use the native Gemma3ForConditionalGeneration class.
Non-Gemma VLMs fallback to appropriate Hugging Face model classes dynamically.
We can add more models & model_classes as required to model_class_map to extend support for testing other VLMs.

Additionally, added a minor fix in train.py:

fixed project_name to fallback to None when no project name is given (which otherwise interrupts the training with an error).

aditi-dsi · 2025-05-28T20:46:05Z

@sergiopaniego this PR is ready for review.

ariG23498 · 2025-05-30T08:29:59Z

I think AutoModelForImageTextToText should entail everything. Could you give it a try?

aditi-dsi · 2025-05-31T11:09:09Z

Sure, taking a look, will let you know.

aditi-dsi added 2 commits May 29, 2025 01:55

feat(train, predict): generalize model loading to test other VLMs

ab3da50

fix(wandb.init): assign project_name only when available in config

15eb2da

aditi-dsi mentioned this pull request May 29, 2025

[Contributions Welcome] Improving Our Fine-Tuning Pipeline #12

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feat #12: Add support for dynamic model loading with --model flag to test other VLMs #26

Feat #12: Add support for dynamic model loading with --model flag to test other VLMs #26

Uh oh!

aditi-dsi commented May 28, 2025

Uh oh!

aditi-dsi commented May 28, 2025

Uh oh!

ariG23498 commented May 30, 2025

Uh oh!

aditi-dsi commented May 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Feat #12: Add support for dynamic model loading with --model flag to test other VLMs #26

Are you sure you want to change the base?

Feat #12: Add support for dynamic model loading with --model flag to test other VLMs #26

Uh oh!

Conversation

aditi-dsi commented May 28, 2025

Uh oh!

aditi-dsi commented May 28, 2025

Uh oh!

ariG23498 commented May 30, 2025

Uh oh!

aditi-dsi commented May 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants