v0.7.7
v0.7.7
This release mainly targets autodiff. It fixes SPIR-V backends (Metal, Vulkan), significantly improves runtime speed (up to 30%), and add full support of debug mode.
What's Changed
AutoDiff
- [Perf] Adstack max-reducer: launch cache + zero-copy result map; content-stable registry_id by @duburcqa in #671
- [SPIR-V] dispatch_max_reducers: register each task with the real kernel name by @duburcqa in #675
- [AutoDiff] Debug-mode field/grad/dual: dtype, layout, and access-time invariants by @duburcqa in #677
Full Changelog: v0.7.6...v0.7.7