v5.7.2rc1
Pre-releaseThis release adds a setting to reduce peak VRAM usage and improve performance, plus a few other fixes and enhancements.
Memory Management Improvements
By default, Invoke uses pytorch
's own memory allocator to load and manage models in VRAM. CUDA also provides a memory allocator, and on many systems, the CUDA allocator outperforms the pytorch
allocator, reducing peak VRAM usage and improving performance overall.
You can use the new pytorch_cuda_alloc_conf
setting in invokeai.yaml
to opt-in to CUDA's memory allocator:
pytorch_cuda_alloc_conf: "backend:cudaMallocAsync"
If you do not add this setting, Invoke will continue to use the pytorch
allocator (same as it always has).
There are other possible configurations you can use for this setting, dictated by pytorch
. Refer to the new section in the Low-VRAM mode docs for more information.
Other Changes
- You may now upload WEBP images to Invoke. They will be converted to PNGs for use within the application. Thanks @keturn!
- More conservative estimates for VAE VRAM usage. This aims to reduce the slowdowns and OOMs on the VAE decode step.
- Fixed "single or collection" field type rendering in the Workflow Editor. This was causing fields like IP Adapter's images and ControlNet's control weights from displaying a widget.
- Fixed the download button in the Workflow Library list, which was downloading the active workflow instead of the workflow for which the button was clicked.
Installing and Updating
The new Invoke Launcher is the recommended way to install, update and run Invoke. It takes care of a lot of details for you - like installing the right version of python - and runs Invoke as a desktop application.
Follow the Quick Start guide to get started with the launcher.
If you don't want to use the launcher, or need a headless install, you can follow the manual install guide.
What's Changed
- Tidy app entrypoint by @RyanJDick in #7668
- Do not cache image layers in CI docker build by @ebr in #7712
- Add
pytorch_cuda_alloc_conf
config to tune VRAM memory allocation by @RyanJDick in #7673 - Increase VAE decode memory estimates by @RyanJDick in #7674
- fix(ui): download button in workflow library downloads wrong workflow by @psychedelicious in #7715
- docs: update RELEASE.md by @psychedelicious in #7707
- fix(ui): single or collection field rendering by @psychedelicious in #7714
- feat: accept WebP uploads for assets by @keturn in #7718
- chore: bump version to v5.7.2rc1 by @psychedelicious in #7721
Full Changelog: v5.7.1...v5.7.2rc1