v212
This release packs a lot into it. It introduces a new experimental performance monitor for linux machines first. In the UI there is a new tab that will show up to the last hour of statistics:
Additionally a /metrics for the common prometheus and grafana combo. A grafana dashboard example is provided to get you started. It looks like this:
Other small changes
- versionless API endpoints were added that do not require the v1/ prefix. These help with upstream peers like z.ai that do not follow the v1 versioning convention
- the
-watch-configsystem has been refactored. It supports a mounting the config file into a docker container now. This removes the requirement to mount a directory with the config in it.
Contributions from the community
Much thanks to @bankjaneo (#741), @rhtenhove (#746), @sousekd (#753).
Changelog
- aac7b87 ci: set go-version-file in release workflow
- 4e606fe ci: fix workflow bugs in release and go-ci
- a4b91e0 Changes and fixes before the release (docs/small tweaks) (#750)
- 3e3646f perf: ignore LACT devices reporting zero VRAM (#753)
- a01afe2 ci: use manifest-aware cleanup action for multi-arch :cpu (#751)
- 174e856 Multi arch cpu (#746)
- 085b54b proxy: fix data race in /running endpoint and typo in error message (#748)
- 2be3416 ui: add auto theme switch mode based on system theme (#741)
- 7e3e94a proxy,ui: add performance monitoring with Prometheus metrics (#743)
- e261745 proxy: add versionless API endpoint (#733)
- 11b7913 llama-swap.go: remove debounce, replace fmt.Printlns (#731)