Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

First Block Caching Infra for diffusers Diffusers Use for PR related to diffusers in efficient-transformers.
#941 opened Apr 24, 2026 by quic-amitraj Contributor Loading…
Logging support added for HF Trainer stack
#938 opened Apr 22, 2026 by quic-abhamidi Loading…
feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill enhancement New feature or request
#935 opened Apr 21, 2026 by vbaddi Contributor Loading…
Added MDP generation to QEff Compile
#930 opened Apr 21, 2026 by quic-mohmeh Loading…
Enabled Qwen3-VL embedding model
#923 opened Apr 20, 2026 by quic-amitraj Contributor Loading…
[Qwen3_Omni]_Onboarding
#922 opened Apr 20, 2026 by mohiso22 Contributor Draft
Enabling support of rerankers models 2B and 8B of qwen3vl
#921 opened Apr 18, 2026 by quic-amitraj Contributor Loading…
MLA perf
#910 opened Apr 8, 2026 by quic-mamta Contributor Loading…
feat: Enable benchmark-mode module inventory/export across all CausalLM architectures enhancement New feature or request
#906 opened Apr 3, 2026 by vbaddi Contributor Loading…
qwen3_5_linear_attn
#901 opened Apr 1, 2026 by mohiso22 Contributor Draft
[Nightly CI]: Creating CI Pipeline for Nightly Build
#828 opened Mar 5, 2026 by abukhoy Contributor Draft
FirstCache for Diffusers
#803 opened Feb 23, 2026 by quic-amitraj Contributor Draft
Add support for num_crops and valid_size from vLLM
#796 opened Feb 17, 2026 by quic-vargupt Contributor Loading…
MLA
#789 opened Feb 10, 2026 by quic-mamta Contributor Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.