Draft: Merge LoRA Adapters with AWQ BaseModels by Whadup · Pull Request #2418 · huggingface/peft

Whadup · 2025-03-10T17:23:03Z

This PR extends the AwqLoraLinear class to allow merging in of LoRA Adapters.
Instead of re-quantizing the whole model, we use the original quantization scales and zeros.

BenjaminBossan

Thanks for adding merging capabilities to AWQ. I only skimmed the PR so far, but could you please:

Also implement the unmerge method? It should be very similar to the merge method, but remove the delta weight
There should be a unit test to ensure that merging works, e.g. similar to this test (without DoRA).
Let's run make style on your changes.

Whadup · 2025-03-11T10:29:40Z

@BenjaminBossan Thanks for looking into it already! Your three points are on my agenda, I will give you a ping when I commit the changes.

BenjaminBossan · 2025-03-11T10:41:37Z

Great, thanks a lot.

github-actions · 2025-04-10T15:04:18Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

BenjaminBossan · 2025-04-10T15:16:29Z

@Whadup Do you still plan on working on this?

github-actions · 2025-05-05T15:04:17Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

BenjaminBossan · 2025-05-05T15:13:59Z

It's not quite clear to me, but it appears like AutoAWQ will be integrated into llm-compressor:

AutoAWQ Integration: Perform low-bit weight-only quantization efficiently using AutoAWQ, now part of LLM Compressor. Note: This integration should be considered experimental for now.

github-actions · 2025-05-30T15:04:13Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Whadup added 3 commits March 10, 2025 18:15

Merge Lora Adapters into AWQ GEMM Layers

ee1239b

undo accidental change

43c1979

remove unused import

deb3e98

Whadup changed the title ~~Merge LoRA Adapters with AWQ BaseModels [Experimental]~~ Draft: Merge LoRA Adapters with AWQ BaseModels Mar 10, 2025

BenjaminBossan requested changes Mar 11, 2025

View reviewed changes

github-actions bot closed this Jun 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: Merge LoRA Adapters with AWQ BaseModels#2418

Draft: Merge LoRA Adapters with AWQ BaseModels#2418
Whadup wants to merge 3 commits intohuggingface:mainfrom
Whadup:awq-lora-merge

Whadup commented Mar 10, 2025

Uh oh!

BenjaminBossan left a comment

Uh oh!

Whadup commented Mar 11, 2025

Uh oh!

BenjaminBossan commented Mar 11, 2025

Uh oh!

github-actions bot commented Apr 10, 2025

Uh oh!

BenjaminBossan commented Apr 10, 2025

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

BenjaminBossan commented May 5, 2025

Uh oh!

github-actions bot commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Whadup commented Mar 10, 2025

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Whadup commented Mar 11, 2025

Uh oh!

BenjaminBossan commented Mar 11, 2025

Uh oh!

github-actions bot commented Apr 10, 2025

Uh oh!

BenjaminBossan commented Apr 10, 2025

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

BenjaminBossan commented May 5, 2025

Uh oh!

github-actions bot commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants