Feature: Add knowledge distillation support #2595

mrT23 · 2025-10-15T11:43:08Z

Knowledge distillation is one of the most effective techniques for achieving state-of-the-art results in model pre-training and finetuning. It has become a standard component in nearly every competitive paper and effectively sets the gold standard for top-tier performance.

Additionally, when training with knowledge distillation, the sensitivity to training parameters and the reliance on training tricks is significantly reduced.

This pull request adds simple support for training with knowledge distillation on ImageNet.
The proposed implementation is straightforward, clean, and robust, building on the methodology from https://arxiv.org/abs/2204.03475, and the reference implementation at: https://github.com/Alibaba-MIIL/Solving_ImageNet

HuggingFaceDocBuilderDev · 2025-10-15T21:33:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

mrT23 · 2025-10-21T13:01:17Z

merged #2598 to this PR

Add knowledge distillation model and loss function support

b48e88c

mrT23 changed the title ~~Add knowledge distillation model and loss function support~~ Feature: Add knowledge distillation support Oct 15, 2025

rwightman and others added 5 commits October 17, 2025 10:30

Merge branch 'master' of github.com:mrT23/pytorch-image-models

3c5135c

Cleanup distillation code

d705d67

Keep as no_grad

6dcbc22

Merge branch 'huggingface:main' into master

92b4850

Merge remote-tracking branch 'origin/master'

0eb45e3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feature: Add knowledge distillation support #2595

Feature: Add knowledge distillation support #2595

Uh oh!

mrT23 commented Oct 15, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 15, 2025

Uh oh!

mrT23 commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Feature: Add knowledge distillation support #2595

Are you sure you want to change the base?

Feature: Add knowledge distillation support #2595

Uh oh!

Conversation

mrT23 commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Oct 15, 2025

Uh oh!

mrT23 commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mrT23 commented Oct 15, 2025 •

edited

Loading