-
Notifications
You must be signed in to change notification settings - Fork 59
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
added MXFP4 quantizer support to directly load GPT-OSS models via QEFFAutoModelForCausalLM
enhancement
New feature or request
quantization
#577
opened Sep 26, 2025 by
ochougul
Loading…
Fix llama model o_proj lora_ids passing for finite lorax
#575
opened Sep 25, 2025 by
quic-jouachen
Loading…
Example walk through on how to onboard a Causal LM on Qefficient Transformers.
#574
opened Sep 24, 2025 by
quic-dhirajku
Loading…
Fix llama model o_proj lora_ids passing for finite lorax in release/v1.20.0
#573
opened Sep 23, 2025 by
quic-jouachen
Loading…
Updated KV_pytorch and ORT inference for VLMs to incorporate image_idx
#557
opened Sep 10, 2025 by
quic-dhirajku
Loading…
Logger Module For Efficient Transformers
1.21.0
wip
Work in progress
#555
opened Sep 10, 2025 by
quic-hemagnih
•
Draft
Extend On-Device Sampling Support to more Causal Language Models
#553
opened Sep 4, 2025 by
quic-sanising
Loading…
TF ver 4.55.0, pytorch 2.7.1, hf hub 0.34.0 and diffusers 0.31.0
#551
opened Sep 3, 2025 by
quic-hemagnih
•
Draft
Optimized ONNX Transform via Class Merging and Thread Pooling
#546
opened Aug 23, 2025 by
abhishek-singh591
Loading…
Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9
#542
opened Aug 19, 2025 by
quic-mamta
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.