[Accelerate] Support `get_offloaded_device` for models #364

kylesayrs · 2025-06-20T20:43:37Z

Purpose

Enable util that may be useful for dealing with offloading of modules which are no leaf modules. For example, if we want to attach parameters to an attention module for attention quantization, we'll need to know the offload device of the attention module (which is not a leaf module)

Changes

Generalize get_offloaded_device to support nested modules

Testing

Added additional tests, previous tests pass and previous behavior is preserved

Signed-off-by: Kyle Sayers <[email protected]>

support models

5856c27

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs changed the title ~~[Accelerate] Support inference of offload device for models~~ [Accelerate] Support get_offloaded_device for models Jun 20, 2025

kylesayrs added 3 commits June 20, 2025 16:58

clarify docstring

f4d8cfe

Signed-off-by: Kyle Sayers <[email protected]>

add tests

0e77936

Signed-off-by: Kyle Sayers <[email protected]>

fix testing marks

b6f77f3

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs marked this pull request as ready for review July 31, 2025 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Accelerate] Support `get_offloaded_device` for models #364

[Accelerate] Support `get_offloaded_device` for models #364

kylesayrs commented Jun 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

[Accelerate] Support get_offloaded_device for models #364

Are you sure you want to change the base?

[Accelerate] Support get_offloaded_device for models #364

Conversation

kylesayrs commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Changes

Testing

Uh oh!

Uh oh!

[Accelerate] Support `get_offloaded_device` for models #364

[Accelerate] Support `get_offloaded_device` for models #364

kylesayrs commented Jun 20, 2025 •

edited

Loading