Skip to content

v0.6.0

Choose a tag to compare

@wenhuach21 wenhuach21 released this 24 Jul 02:33
· 222 commits to main since this release
v0.6.0
dd95bdb

Highlights

  • provide experimental support for gguf q*_k format and customized mixed bits setting
  • support xpu in triton backend by @wenhuach21 in #563
  • add torch backend by @WeiweiZhang1 in #555
  • provide initial support of llmcompressor format, only INT8 W8A8 dynamic quantization is supported by @xin3he in #646

What's Changed

New Contributors

Full Changelog: v0.5.1...v0.6.0