[WIP] Eagle3 Training Implementation #143

fynnsu · 2025-10-03T16:16:57Z

WORK IN PROGRESS

Todos/Missing:

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

github-actions · 2025-10-03T16:19:38Z

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/vllm-project/speculators/actions/runs/18233863608/artifacts/4179292557.
They will be retained for up to 30 days.
Commit: 76aa8a1

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

Add Eagle3DraftModel implementation

0e0d02e

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

Initial training loop script + batched data loading and collation

ae94a42

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

fynnsu force-pushed the eagle3_training branch from 0e33c78 to 6f172d4 Compare October 3, 2025 20:19

fynnsu added 3 commits October 3, 2025 16:36

Add loss masking

20cd35f

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

Disable training on verifier_lm_head

08c8848

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

Add Distributed Data Parallel (DDP)

76aa8a1

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

fynnsu force-pushed the eagle3_training branch from 6f172d4 to 76aa8a1 Compare October 3, 2025 21:09

Provide feedback