Change the repository type filter
All
Repositories list
72 repositories
llmq
PublicQuantized LLM training in pure CUDA/C++.Quartet
Publiclocal_platinum_bench
PublicMoE-Quant
PublicQuEST
PublicEvoPress
PublicFP-Quant
PublicGridSearcher
Publicnanochat
PublicCAGE
Publicqutlass
PublicQuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learningtorchtitan
Publicnanochat-qat
PublicCAGE-ao
Publicunified-sc-laws
PublicISTA-DASLab-Optimizers
Publicgptq-gguf-toolkit
Publicinfluence_distillation
PublicOfficial implementation of Influence Distillation: https://www.arxiv.org/abs/2505.19051PanzaMail
PublicHALO-anon
Publictorch_cgx
Publicgemm-int8
PublicDarwinLM
PublicScalableMNN
PublicSPADE
PublicHALO
PublicHALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arxiv.org/abs/2501.02625gemm-fp8
PublicMicroAdam
Publicllm-foundry
Public