Skip to content

v212

Choose a tag to compare

@github-actions github-actions released this 14 May 05:14
· 17 commits to main since this release
aac7b87

This release packs a lot into it. It introduces a new experimental performance monitor for linux machines first. In the UI there is a new tab that will show up to the last hour of statistics:

image

Additionally a /metrics for the common prometheus and grafana combo. A grafana dashboard example is provided to get you started. It looks like this:

image

Other small changes

  • versionless API endpoints were added that do not require the v1/ prefix. These help with upstream peers like z.ai that do not follow the v1 versioning convention
  • the -watch-config system has been refactored. It supports a mounting the config file into a docker container now. This removes the requirement to mount a directory with the config in it.

Contributions from the community

Much thanks to @bankjaneo (#741), @rhtenhove (#746), @sousekd (#753).

Changelog

  • aac7b87 ci: set go-version-file in release workflow
  • 4e606fe ci: fix workflow bugs in release and go-ci
  • a4b91e0 Changes and fixes before the release (docs/small tweaks) (#750)
  • 3e3646f perf: ignore LACT devices reporting zero VRAM (#753)
  • a01afe2 ci: use manifest-aware cleanup action for multi-arch :cpu (#751)
  • 174e856 Multi arch cpu (#746)
  • 085b54b proxy: fix data race in /running endpoint and typo in error message (#748)
  • 2be3416 ui: add auto theme switch mode based on system theme (#741)
  • 7e3e94a proxy,ui: add performance monitoring with Prometheus metrics (#743)
  • e261745 proxy: add versionless API endpoint (#733)
  • 11b7913 llama-swap.go: remove debounce, replace fmt.Printlns (#731)