quic / efficient-transformers Public

Notifications You must be signed in to change notification settings
Fork 59
Star 80

Code
Issues 2
Pull requests 42
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: quic/efficient-transformers

Labels 22 Milestones 0

New pull request New

42 Open 528 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Sd3_custom_H_W_support

#580 opened Sep 29, 2025 by tv-karthikeya • Draft

[QEff Finetune] Correction in data type of loss

#579 opened Sep 29, 2025 by quic-swatia • Draft

Onboarding OmniSVG

#578 opened Sep 27, 2025 by qcdipankar • Draft

added MXFP4 quantizer support to directly load GPT-OSS models via QEFFAutoModelForCausalLM enhancement

New feature or request

quantization

#577 opened Sep 26, 2025 by ochougul

Loading…

Adding Compute-Context-Length (CCL)

#576 opened Sep 26, 2025 by vjanfaza

Loading…

Fix llama model o_proj lora_ids passing for finite lorax

#575 opened Sep 25, 2025 by quic-jouachen

Loading…

Example walk through on how to onboard a Causal LM on Qefficient Transformers.

#574 opened Sep 24, 2025 by quic-dhirajku

Loading…

Fix llama model o_proj lora_ids passing for finite lorax in release/v1.20.0

#573 opened Sep 23, 2025 by quic-jouachen

Loading…

Wav2vec2 onboarding

#571 opened Sep 23, 2025 by tchawada

Loading…

Added support for InternVL_3_5 series of VLMs.

#566 opened Sep 19, 2025 by quic-dhirajku

Loading…

[Qwen2_5_vl] - Onboarding Qwen2_5_vl model in QEfficient

#560 opened Sep 12, 2025 by mohiso22 • Draft

Llama4 VLM Continuous Batching Support

#559 opened Sep 11, 2025 by asmigosw • Draft

Updated KV_pytorch and ORT inference for VLMs to incorporate image_idx

#557 opened Sep 10, 2025 by quic-dhirajku

Loading…

Resolved Custom IO file not found via CLI

#556 opened Sep 10, 2025 by abhishek-singh591

Loading…

Logger Module For Efficient Transformers 1.21.0 wip

Work in progress

#555 opened Sep 10, 2025 by quic-hemagnih • Draft

Onboarding Molmo Model

#554 opened Sep 8, 2025 by mohiso22

Loading…

Extend On-Device Sampling Support to more Causal Language Models

#553 opened Sep 4, 2025 by quic-sanising

Loading…

Onnx function for MMDit block

#552 opened Sep 3, 2025 by quic-akuruvil • Draft

TF ver 4.55.0, pytorch 2.7.1, hf hub 0.34.0 and diffusers 0.31.0

#551 opened Sep 3, 2025 by quic-hemagnih • Draft

Embedding Model fix wip

Work in progress

#548 opened Aug 28, 2025 by quic-amitraj • Draft

Added Multiframe Inference for llama4+internvl

#547 opened Aug 27, 2025 by aditjadh

Loading…

Optimized ONNX Transform via Class Merging and Thread Pooling

#546 opened Aug 23, 2025 by abhishek-singh591

Loading…

updated notebooks

#543 opened Aug 20, 2025 by smedhe

Loading…

Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9

#542 opened Aug 19, 2025 by quic-mamta

Loading…

removed platform sdk dependency

#540 opened Aug 19, 2025 by smedhe

Loading…

Previous 1 2 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!