vllm-project / llm-compressor Public

Notifications You must be signed in to change notification settings
Fork 271
Star 2.2k

Code
Issues 56
Pull requests 42
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: vllm-project/llm-compressor

Labels 20 Milestones 0

New pull request New

42 Open 886 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[AWQ] allow for use of model-wide kwargs cache

#1985 opened Oct 30, 2025 by brian-dellabetta • Draft

[Qwen3 Next] Update qwen3_next to use moe_calibration_context nvfp4

For any PR / issue related to NVFP4 support

qwen

For any PR / issue related to Qwen support

ready

When a PR is ready for review

#1984 opened Oct 30, 2025 by dsikka • Draft

[MoE] Clean up imports, add qwen3_moe_vl, change logger level ready

When a PR is ready for review

#1981 opened Oct 29, 2025 by kylesayrs

Loading…

[InternVL3]Add internvl3 quantizing example

#1977 opened Oct 29, 2025 by BigFaceBoy

Loading…

[AWQ] Allow users to disable quantization during AWQ

#1973 opened Oct 28, 2025 by brian-dellabetta • Draft

[PTQ] weights_ptq pathway for day-zero weight quantization support

#1971 opened Oct 28, 2025 by kylesayrs • Draft

Modernize entrypoints module with type hints and use generic types ready

When a PR is ready for review

#1965 opened Oct 25, 2025 by sugatmahanti

Loading…

Fixing untie to be used only as needed and automatic ready

When a PR is ready for review

#1963 opened Oct 24, 2025 by HDCharles

Loading…

[WIP] Generalize AWQ quantization

#1961 opened Oct 22, 2025 by kylesayrs • Draft

Adding new MoE e2e tests [wip]

#1960 opened Oct 22, 2025 by HDCharles • Draft

[Oneshot] Add validation for empty dataset and enhance oneshot function parameters

#1957 opened Oct 21, 2025 by ArkaSanka

Loading…

[Autowrapper] Trace vision tower for better offloading

#1948 opened Oct 18, 2025 by kylesayrs • Draft

[MXFP4] Support

#1938 opened Oct 15, 2025 by dsikka • Draft

[Observers] Change MSE global scale objective function

#1935 opened Oct 14, 2025 by kylesayrs • Draft

AI Fix for: Create AWQ guide for llm-docs

#1932 opened Oct 14, 2025 by shanaya-Gupta

Loading…

[Attention] Support FP4 attention quantization nvfp4

For any PR / issue related to NVFP4 support

#1924 opened Oct 14, 2025 by kylesayrs

Loading…

Add: File Based Caching for lm_eval tests

#1900 opened Oct 6, 2025 by rahul-tuli • Draft

[Training] Fix tokenizer attribute of SessionMixin ready

When a PR is ready for review

#1895 opened Oct 1, 2025 by kylesayrs

Loading…

add gpt oss nvfp4 example

#1885 opened Sep 30, 2025 by shanjiaz • Draft

Add awq activation fp8 support in loss compute

#1873 opened Sep 27, 2025 by Bluedyson

Loading…

[Dependencies] update lm_eval version pin ready

When a PR is ready for review

#1862 opened Sep 24, 2025 by brian-dellabetta

Loading…

[Logging] clean up CompressionLogger verbosity ready

When a PR is ready for review

#1861 opened Sep 23, 2025 by brian-dellabetta

Loading…

MSE observer for NVFP4

#1840 opened Sep 17, 2025 by shubhra

Loading…

Updating base.py (parallel calibration and model #1809)

#1837 opened Sep 17, 2025 by aashvgit

Loading…

Add file to linearize and quantize the gpt-oss models

#1831 opened Sep 17, 2025 by shubhra

Loading…

Previous 1 2 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!