Polish Eagle3 config and model definitions #79

markurtz · 2025-08-05T20:11:18Z

Description

Share code between Eagle and Eagle3 implementations as base class to minimize redundancy
Remove redundant params and source as much info from the transformers config as possible
Remove current implementation for token mappings/head pruning, more needs to be worked in to solidify the direction and implementation there
Minor cleanup for eagle3 module

Copilot

Pull Request Overview

This PR refactors the Eagle3 model implementation to reduce code duplication and streamline configuration. It introduces a shared base class for transformer layer configuration and removes the current vocabulary mapping implementation.

Extract shared transformer layer configuration into TransformerLayerConfigMixin base class
Remove redundant configuration parameters and vocabulary mapping logic from Eagle3
Clean up Eagle3 model initialization and forward pass implementation

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
src/speculators/models/eagle3.py	Simplified Eagle3 configuration and model by removing vocab mapping, using mixin for shared config
src/speculators/models/eagle.py	Extracted transformer layer config logic into reusable `TransformerLayerConfigMixin` class
src/speculators/convert/eagle/eagle3_converter.py	Updated Eagle3 converter to remove deprecated config parameters

src/speculators/models/eagle3.py

Co-authored-by: Copilot <[email protected]>

github-actions · 2025-08-05T20:13:20Z

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/neuralmagic/speculators/actions/runs/16760456544/artifacts/3694860958.
They will be retained for up to 30 days.
Commit: 653f164

rahul-tuli

We should be testing conversion both with and without reduced vocab and a dummy forward pass based on known shapes before making these changes, Because our diffs have already merged into vllm, I'm afraid we will break vllm serve <speculators-model> without testing

rahul-tuli · 2025-08-28T16:41:11Z

src/speculators/models/eagle3.py

-
        self.fc = nn.Linear(
-            3 * self.target_hidden_size,  # Use target model's hidden size
+            3 * self.hidden_size,  # Use target model's hidden size


@markurtz since the hidden states are coming from the verifier, shouldn't this be 3 * self.target_hidden_size?

rahul-tuli · 2025-08-28T16:48:42Z

src/speculators/models/eagle3.py

-        self.register_buffer(  # type: ignore[attr-defined]
-            "d2t",
-            torch.zeros(self.draft_vocab_size, dtype=torch.long),
-        )
-        self.register_buffer(  # type: ignore[attr-defined]
-            "t2d",
-            torch.zeros(self.target_vocab_size, dtype=torch.bool),
-        )


Why remove these? wouldn't this break the conversion of existing checkpoints? I think it would be nice to have their presence reflected in the config, and initialize these based on that arg; something like: reduced_vocab: bool I would also argue if this arg is True we should keep target_vocab_size along with vocab_size

Polish Eagle3 config and model definitions

ef8d232

markurtz requested review from dsikka, rahul-tuli, MeganEFlynn and Copilot August 5, 2025 20:11

Merge branch 'main' into feature/eagle3_polish

64afe9c

Copilot AI reviewed Aug 5, 2025

View reviewed changes

src/speculators/models/eagle3.py Outdated Show resolved Hide resolved

src/speculators/models/eagle3.py Show resolved Hide resolved

src/speculators/models/eagle3.py Show resolved Hide resolved

Update src/speculators/models/eagle3.py

824b122

Co-authored-by: Copilot <[email protected]>

Merge branch 'main' into feature/eagle3_polish

653f164

rahul-tuli requested changes Aug 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Polish Eagle3 config and model definitions #79

Polish Eagle3 config and model definitions #79

Uh oh!

markurtz commented Aug 5, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 5, 2025 •

edited

Loading

Uh oh!

rahul-tuli left a comment

Uh oh!

rahul-tuli Aug 28, 2025

Uh oh!

rahul-tuli Aug 28, 2025

Uh oh!

Uh oh!

Polish Eagle3 config and model definitions #79

Are you sure you want to change the base?

Polish Eagle3 config and model definitions #79

Uh oh!

Conversation

markurtz commented Aug 5, 2025

Description

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rahul-tuli left a comment

Choose a reason for hiding this comment

Uh oh!

rahul-tuli Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

rahul-tuli Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Aug 5, 2025 •

edited

Loading