Skip to content

Releases: gpustack/llama-box

v0.0.171

16 Aug 03:52

Choose a tag to compare

refactor: adjust context size

Signed-off-by: thxCode <[email protected]>

v0.0.170

13 Aug 03:34

Choose a tag to compare

  1. Rebase upstream.
  2. Fix tool calling with --jinja in Qwen3 Coder.

v0.0.169

07 Aug 10:54

Choose a tag to compare

  1. Rebase upstream.
  2. Fix MUSA release suffix.

v0.0.168

04 Aug 04:54

Choose a tag to compare

  1. Rebase upstream.

v0.0.167

28 Jul 02:23

Choose a tag to compare

  1. Rebase upstream;
  2. Bump MUSA to rc4.2.0, cc @yeahdongcn .

v0.0.166

21 Jul 07:56

Choose a tag to compare

  1. Rebase upstream.

v0.0.165

17 Jul 11:03

Choose a tag to compare

  1. Rebase upstream;
  2. Support Kimi-K2.

v0.0.164

11 Jul 15:50

Choose a tag to compare

  1. Rebase upstream;
  2. Fix zero offloading VRAM occupied in DL packages.

v0.0.163

10 Jul 01:34

Choose a tag to compare

  1. Fix failed while chatting service deployed inside a Docker container.

v0.0.162

08 Jul 19:59

Choose a tag to compare

  1. Simplify samplers;
  2. Refactor managing embeddings KV cache;
  3. Fix invalid capacity in Darwin RPC server.