-
Notifications
You must be signed in to change notification settings - Fork 135
feat: add adamw 8bit #1075
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
feat: add adamw 8bit #1075
Conversation
|
I think this has some issues with SFT checkpointing unfortunately but I lack bandwidth to dig into why. Leaving this draft pr up in case I get more time or someone else wants to take a stab. |
|
Hey, this is a really usefull PR, thank you for the work. Really weird that the dcp check-pointing is failing, I will look into it when I have more bandwidth |
|
Glad to hear it! I found it very useful to reduce memory requirements for this project: https://x.com/neev_parikh/status/1967767438243876924 |


Adds AdamW 8Bit from TorchAO.