Skip to content

Conversation

@ngc92
Copy link
Collaborator

@ngc92 ngc92 commented Dec 15, 2025

a few small updates:

  • error message in case of GPU oversubscription
  • better logging messages
  • new command line options for seed + better cli validation
  • copying config files (e.g., tokenizer.json) to exported model

update benchmarks in readme. add additional files under /benchmarks that contain the actual commands used for easier replication. also, add baseline configs for llama-factory on the 4090.

@ngc92 ngc92 merged commit 229ba2f into dev Dec 15, 2025
29 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants