[OpenVINO] Update InferRequestWrapper to collect samples taking into account state of stateful models #1505

nikita-savelyevv · 2025-10-31T12:58:59Z

What does this PR do?

Changes

Add an extra value to input dict when wrapping infer request of a stateful model. This feature is not part of the latest NNCF v.2.19 release and depends on NNCF develop version.

Reason for changes

Quantization quality of stateful Whisper models is poor because a state of a stateful model must be cleared with the same schedule as it is done during calibration input data collection.

Ticket CVS-172705

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2025-10-31T13:01:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…ful OV models (#3714) ### Changes Added `nncf.definitions.NNCF_DATASET_RESET_STATE_KEY` constant to specify when to reset model state. This constant is used by OpenVINO backend to control resetting of internal model state between model inferences. This key can be added to a dataset sample input dictionary with either `True` or `False` value. With `True` value, the model state will be reset before inference on the corresponding sample, and with `False` the state will not be reset. For an example of usage please see huggingface/optimum-intel#1505. ### Reason for changes Without this logic static quantization quality of stateful Whisper models is poor because a state of a stateful model must be cleared with the same schedule as it is done during calibration input data collection. ### Related tickets 172705 ### Tests Added `tests/openvino/native/test_engine.py::test_stateful_model_inference_with_controlled_resetting`.

Copilot

Pull request overview

This PR enhances the InferRequestWrapper class to properly handle state management for stateful OpenVINO models during quantization calibration. The changes ensure that state reset operations are tracked and communicated to NNCF's calibration process, which is critical for accurate quantization of stateful models like Whisper.

Key changes:

Added state tracking mechanism to detect and record when model state is reset
Modified input collection to include state reset information for NNCF calibration (version 2.19+)
Implemented reset_state() method wrapper to track state reset calls

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

optimum/intel/openvino/quantization.py

echarlaix

LGTM!

optimum/intel/openvino/quantization.py

tests/openvino/test_quantization.py

optimum/intel/openvino/quantization.py

echarlaix

Thanks for iterating, feel free to merge!

nikita-savelyevv mentioned this pull request Oct 31, 2025

[OpenVINO] Introduce a way to control resetting of the state of stateful OV models openvinotoolkit/nncf#3714

Merged

nikita-savelyevv changed the title ~~[OpenVINO] Update InferRequestWrapper to control stateful model state resetting~~ [OpenVINO] Update InferRequestWrapper to collect samples taking into account state of stateful models Oct 31, 2025

nikita-savelyevv requested a review from Copilot December 2, 2025 14:56

Copilot AI reviewed Dec 2, 2025

View reviewed changes

optimum/intel/openvino/quantization.py Show resolved Hide resolved

optimum/intel/openvino/quantization.py Outdated Show resolved Hide resolved

optimum/intel/openvino/quantization.py Show resolved Hide resolved

nikita-savelyevv added the openvino-nightly Runs OpenVINO nightly and NNCF develop tests label Dec 2, 2025

nikita-savelyevv force-pushed the ns/stateful-models-quantization branch from 76a11e5 to 03ce84c Compare December 2, 2025 17:56

nikita-savelyevv added 2 commits December 2, 2025 18:58

Initial commit

03ce84c

Fix test

a9f152b

nikita-savelyevv marked this pull request as ready for review December 3, 2025 08:24

nikita-savelyevv requested review from IlyasMoutawwakil, echarlaix and rkazants December 3, 2025 08:24

echarlaix approved these changes Dec 3, 2025

View reviewed changes

optimum/intel/openvino/quantization.py Outdated Show resolved Hide resolved

optimum/intel/openvino/quantization.py Outdated Show resolved Hide resolved

tests/openvino/test_quantization.py Show resolved Hide resolved

optimum/intel/openvino/quantization.py Show resolved Hide resolved

nikita-savelyevv added 2 commits December 3, 2025 17:30

Merge branch 'main' into ns/stateful-models-quantization_

3eba817

Address suggested changes

07ee0ac

nikita-savelyevv requested a review from echarlaix December 4, 2025 16:19

echarlaix approved these changes Dec 5, 2025

View reviewed changes

Merge branch 'main' into ns/stateful-models-quantization

b839d9a

nikita-savelyevv merged commit f2fa597 into main Dec 9, 2025
38 of 39 checks passed

nikita-savelyevv deleted the ns/stateful-models-quantization branch December 9, 2025 08:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OpenVINO] Update InferRequestWrapper to collect samples taking into account state of stateful models #1505

[OpenVINO] Update InferRequestWrapper to collect samples taking into account state of stateful models #1505

Uh oh!

nikita-savelyevv commented Oct 31, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[OpenVINO] Update InferRequestWrapper to collect samples taking into account state of stateful models #1505

[OpenVINO] Update InferRequestWrapper to collect samples taking into account state of stateful models #1505

Uh oh!

Conversation

nikita-savelyevv commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Changes

Reason for changes

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nikita-savelyevv commented Oct 31, 2025 •

edited

Loading