Uh oh!

There was an error while loading. Please reload this page.

quic / efficient-transformers Public

Notifications You must be signed in to change notification settings
Fork 92
Star 93

Code
Issues 5
Pull requests 55
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: quic/efficient-transformers

Labels 33 Milestones 0

New pull request New

55 Open 1,143 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Moe gemma4 changes

#1217 opened Jul 27, 2026 by tchawada Contributor

Loading…

Adding the tmp CI changes

#1216 opened Jul 25, 2026 by quic-rishinr Contributor

Loading…

feat: Replace num_replicate_kv_heads with boolean replicate_kv_heads flag

#1212 opened Jul 24, 2026 by quic-dhirajku Contributor

Loading…

Adding PagedAttention support for CausalLM models

#1209 opened Jul 23, 2026 by vaibverm Contributor

Loading…

Adding mdp fix for PP + Blocking + Subfunction

#1203 opened Jul 22, 2026 by mohiso22 Contributor • Draft

Adding support for Flux2 klein

#1201 opened Jul 22, 2026 by quic-swatia Contributor

Loading…

[release/v1.22.0]: updating docs for release/v1.22.0

#1196 opened Jul 21, 2026 by abukhoy Contributor

Loading…

Introducing OptimizedMoeTransform

#1190 opened Jul 17, 2026 by ochougul Contributor

Loading…

Add CCL support to Gemma4, Qwen3.5 (moe), and qwn3_vl models

#1188 opened Jul 17, 2026 by vjanfaza Contributor

Loading…

Qwen3.6_Tiling

#1185 opened Jul 15, 2026 by mohiso22 Contributor • Draft

QEff library utility to capture environment/runtime/library metadata right after loading a model before any transforms are applied.

#1183 opened Jul 15, 2026 by quic-dhirajku Contributor

Loading…

Move qeff-bug-fix agent into skills_studio/agents

#1182 opened Jul 14, 2026 by quic-rishinr Contributor

Loading…

3 tasks done

feat(weight-free): weight-free ONNX export for causal LMs (dynamo path) 1.23

Release 1.23 Features

enhancement

New feature or request

qeff.weightfree

WeightFree Onnx Export based on Dyanmo and Meta state

#1181 opened Jul 13, 2026 by amarquic

Loading…

3 tasks

[Causal-LM Test]: Adding causal lm test cases for diff configs

#1180 opened Jul 13, 2026 by abukhoy Contributor

Loading…

fix: eliminate non-constant Slice ends in sliding-window CB subfunction

#1179 opened Jul 13, 2026 by quic-amitraj Contributor

Loading…

4 tasks done

Feature/add glm ocr

#1177 opened Jul 12, 2026 by shagsood • Draft

Custom dtype - bf16/fp16 support

#1175 opened Jul 11, 2026 by asmigosw Contributor • Draft

test-dynamo(0711): add dynamo causal-lm test infra + fix gemma3 and qwen3_moe subfunction export 1.23

Release 1.23 Features

onnx.dynamo wip

Work in progress

#1174 opened Jul 11, 2026 by vbaddi Contributor • Draft

Add reported reproducer config test suite enhancement

New feature or request

#1173 opened Jul 10, 2026 by anujgupt-github Contributor

Loading…

[nightly] Fix garbage output in VLM nightly tests

#1172 opened Jul 10, 2026 by quic-vishali Contributor • Draft

Updating blocking dummy tests qeff.blocking

#1171 opened Jul 10, 2026 by kdulla Contributor • Draft

fix: eliminate non-constant Slice ends in Gemma4 sliding-window CB subfunction 1.22

Release 1.22 candidate

bugfix

#1170 opened Jul 9, 2026 by quic-amitraj Contributor

Loading…

3 tasks done

Gpt_oss prefill: avoid packed expert-axis slicing in chunked MoE to restore MXFP6 constant compression

#1165 opened Jul 8, 2026 by abhishek-singh591 Contributor

Loading…

Enabling model Qwen3-Embedding-0.6B and Qwen3-Embedding-8B

#1151 opened Jul 6, 2026 by quic-amitraj Contributor • Draft

Disagg serving with efficient KV handoff ready for review

#1150 opened Jul 6, 2026 by quic-akuruvil Contributor

Loading…

Previous 1 2 3 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!