We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent fd6cfe1 commit e51efbfCopy full SHA for e51efbf
CHANGELOG.md
@@ -36,6 +36,8 @@
36
* Fix some profiler issues.
37
- Complete the reference for Blackwell blockwise gemm kernels.
38
- Fix incorrect regex logic for L1 test.
39
+* Various improvements and fixes from the community and CUTLASS team. Thanks to everyone who submitted PRs!
40
+* Optimal code generation with CUDA toolkit versions 12.9.
41
42
## [4.0.0](https://github.com/NVIDIA/cutlass/releases/tag/v4.0.0) (2025-06-03)
43
0 commit comments