Update speculator config & converter to support hidden states indexing #142

shanjiaz · 2025-09-29T21:11:10Z

Changes:

Added support for optional arguments eagle_aux_hidden_state_layer_ids and inference_type.
Added more robust logic for target_vocab_size. We default on using "t2d" length, if not available, load the config file of verifier model, recursively search the dict for vocab_size. (The search is needed for nested dict. e.g. target_config_dict["text_config"]["vocab_size"] )

Command used:

speculators convert nvidia/Llama-4-Maverick-17B-128E-Eagle3 \
  --algorithm eagle3 \
  --verifier RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16 \
  --output-path Llama4-Maverick-Eagle3-Speculators \
  --validate-device cuda:0 \
  --algorithm-kwargs '{"eagle_aux_hidden_state_layer_ids": [1,23,44], "inference_type": "text"}'

Converted checkpoint:

shanjiaz/Llama4-Maverick-Eagle3-Speculators-converted

Signed-off-by: shanjiaz <[email protected]>

github-actions · 2025-09-29T21:13:35Z

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/vllm-project/speculators/actions/runs/18228590055/artifacts/4177299103.
They will be retained for up to 30 days.
Commit: c4cfbb6

Signed-off-by: shanjiaz <[email protected]>

…culators into hz-update-config

Signed-off-by: shanjiaz <[email protected]>

src/speculators/models/eagle3.py

src/speculators/convert/eagle/eagle3_converter.py

examples/convert/eagle3/apply_eagle3_llama4_maverick.sh

Signed-off-by: shanjiaz <[email protected]>

shanjiaz added 2 commits September 29, 2025 17:09

update speculator config & converter to support new models

969e87a

Signed-off-by: shanjiaz <[email protected]>

minimal change

5966d6e

Signed-off-by: shanjiaz <[email protected]>

shanjiaz and others added 4 commits September 30, 2025 11:34

fix pre-commit

1db8a42

Signed-off-by: shanjiaz <[email protected]>

end of lline and type fix

1cc670d

Signed-off-by: shanjiaz <[email protected]>

type update

ce4a2c3

Signed-off-by: shanjiaz <[email protected]>

Merge branch 'main' into hz-update-config

76daa73

shanjiaz marked this pull request as ready for review October 1, 2025 01:28

shanjiaz added 2 commits October 2, 2025 11:53

remove unused t2d and d2t references

1d2b10c

Signed-off-by: shanjiaz <[email protected]>

Merge branch 'hz-update-config' of https://github.com/neuralmagic/spe…

6948c4a

…culators into hz-update-config

shanjiaz requested review from rahul-tuli, dsikka and fynnsu October 2, 2025 15:56

shanjiaz added 3 commits October 2, 2025 15:27

make t2d and d2t optional

cc52456

Signed-off-by: shanjiaz <[email protected]>

removed unused functions

0ffbddc

Signed-off-by: shanjiaz <[email protected]>

removed unused imports

1d2494c

Signed-off-by: shanjiaz <[email protected]>

rahul-tuli requested changes Oct 3, 2025

View reviewed changes

shanjiaz added 8 commits October 3, 2025 09:32

remove inference type

53ee14a

Signed-off-by: shanjiaz <[email protected]>

fix examples

58b83dd

Signed-off-by: shanjiaz <[email protected]>

fix examples

1a650a1

Signed-off-by: shanjiaz <[email protected]>

minimum change

2d575be

Signed-off-by: shanjiaz <[email protected]>

make embed_tokens optional

a7537b2

Signed-off-by: shanjiaz <[email protected]>

fix test

040b5d4

Signed-off-by: shanjiaz <[email protected]>

fix references

51d29d6

Signed-off-by: shanjiaz <[email protected]>

quality

c4cfbb6

Signed-off-by: shanjiaz <[email protected]>

shanjiaz requested a review from rahul-tuli October 3, 2025 17:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update speculator config & converter to support hidden states indexing #142

Update speculator config & converter to support hidden states indexing #142

Uh oh!

shanjiaz commented Sep 29, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Update speculator config & converter to support hidden states indexing #142

Are you sure you want to change the base?

Update speculator config & converter to support hidden states indexing #142

Uh oh!

Conversation

shanjiaz commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

Command used:

Converted checkpoint:

Uh oh!

github-actions bot commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shanjiaz commented Sep 29, 2025 •

edited

Loading

github-actions bot commented Sep 29, 2025 •

edited

Loading