Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] Add an awq_asym lm-eval test
#1702 opened Aug 1, 2025 by aladerran Loading…
[MoE] Add conditional expert calibration
#1701 opened Aug 1, 2025 by dichn Loading…
add fp8 block example test
#1697 opened Jul 31, 2025 by derekk-nm Draft
[Example] [VLM] Gemma3n
#1696 opened Jul 31, 2025 by kylesayrs Draft
1686 refactor
#1687 opened Jul 28, 2025 by ved1beta Loading…
add quantization_w4a4_fp4 qwen3 example
#1681 opened Jul 24, 2025 by wangwenmingaa Loading…
[KV Cache] support kv cache int8 per channel quantization ready When a PR is ready for review
#1663 opened Jul 19, 2025 by Eviannn Loading…
[Docs] INFERENG-1089 - Clean up front page and add a Makefile ready When a PR is ready for review
#1660 opened Jul 18, 2025 by aireilly Loading…
[Examples] Fix ignore layers for Qwen2.5-VL
#1658 opened Jul 18, 2025 by SorenDreano Loading…
[Transform] Online Rotations
#1651 opened Jul 16, 2025 by kylesayrs Draft
[Transform] QuIP Modifier
#1648 opened Jul 15, 2025 by kylesayrs Draft
[Pipelines] Add propagate_error argument ready When a PR is ready for review
#1575 opened Jun 20, 2025 by kylesayrs Draft
[GPTQ] Use torch.compile to speed up gptq algo ready When a PR is ready for review
#1561 opened Jun 17, 2025 by aladerran Loading…
Disable sequential_targets from modifiers ready When a PR is ready for review
#1559 opened Jun 16, 2025 by kylesayrs Draft
AWQ minor performance improvements to smoothing ready When a PR is ready for review
#1557 opened Jun 16, 2025 by brian-dellabetta Loading…
[GPTQ] Change actorder default to "static"
#1425 opened May 12, 2025 by kylesayrs Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.