localai 2.26.0 #207814

BrewTestBot · 2025-02-15T20:07:25Z

Created by brew bump

Created with brew bump-formula-pr.

resource blocks have been checked for updates.

release notes

:llama: LocalAI v2.26.0!

Hey everyone - very excited about this release!

It contains several cleanups, performance improvements and few breaking changes: old backends that are now superseded have been removed (for example, vall-e-x), while new backends have been added to expand the range of model architectures that LocalAI can support. While most of the changes are tested, if you encounter issues with the new backends or migrated ones please file a new issue.

We also now have support for Nvidia L4T devices (for example, Nvidia AGX Orin) with specific container images. See the documentation for more details.

:warning: Breaking Changes :warning:

Several backends have been dropped and replaced for improved performance and compatibility.
Vall-e-x and Openvoice were deprecated and dropped.
The stablediffusion-NCN backend was replaced with the stablediffusion-ggml implementation.
Deprecated llama-ggml backend has been dropped in favor of GGUF support.

Check all details!

Backends that were dropped:

Vall-e-x and Openvoice: These projects went silent, and there are better alternatives now. They have been completely superseded by the CoquiTTS community fork, Kokoro, and OutelTTS.
Stablediffusion-NCN: This was the first variant shipped with LocalAI based on the ONNX runtime. It has now been superseded by the stablediffusion-ggml backend, which offers similar capabilities and wider support across more architectures.
Llama-ggml backend: This was the pre-GGUF backend, which is now deprecated. Moving forward, LocalAI will support only GGUF models.

Notable Backend Changes:

Mamba has moved to the transformers backend.
Transformers-Musicgen has moved to the transformers backend.
Sentencetransformers has moved to the transformers backend.

While LocalAI will try to alias to the transformers backend automatically when using these backends, there might be incompatibilies with your configuration files. Please open an issue if you face any problem!

New Backends:

Kokoro (TTS): A new backend for text-to-speech.
OuteTTS: A TTS backend with voice cloning capabilities.
Fast-Whisper: A backend designed for faster whisper model inference.

New Features 🎉

Lazy grammars (llama.cpp): Added grammar triggers for llama.cpp: this allow models trained with specific tokens to enable grammar generation when such tokens are seen: this allows precise JSON generation but also consistent output when the model does not need to answer with a tool. For example, in the config file of the model triggers can be specified as such:

  function:
    grammar:
      triggers:
        word: "<tool_call>"
        at_start: true

Function Argument Parsing Using Named Regex: A new feature that allows parsing function arguments with named regular expressions, simplifying function calls.
Support for New Backends: Added Kokoro, OutelTTS, and Fast-Whisper backends.
Diffusers Update: Added support for Sana pipelines and image generation option overrides.
Machine Tag and Inference Timing: Allows tracking machine performance during inference.
Tokenization: Introduced tokenization support for llama.cpp to improve text processing.
AVX512: There is now bundled support for CPUs supporting AVX512 instruction set
Nvidia L4T: Support for Nvidia devices on arm64, for example Nvidia AGX Orin and alikes. See the documentation. TLDR; You can start container images ready to go with:

docker run -e DEBUG=true \
                    -p 8080:8080 \
                    -v $PWD/models:/build/models  \
                   -ti --restart=always --name local-ai \
                   --runtime nvidia --gpus all quay.io/go-skynet/local-ai:master-nvidia-l4t-arm64-core

Bug Fixes 🐛

Multiple fixes to improve stability, including enabling SYCL support for stablediffusion-ggml and consistent OpenAI stop reason returns.
Improved context shift handling for llama.cpp and fixed gallery store overrides.

🧠 Models:

I've fine-tuned a family of models based on o1-cot and function call datasets to work closely with all LocalAI features regarding function calling. The models are tailored to be conversational and execute function calls:

llama3.2-1b version: https://huggingface.co/mudler/LocalAI-functioncall-llama3.2-1b-v0.4
llama3.2-3b version: https://huggingface.co/mudler/LocalAI-functioncall-llama3.2-3b-v0.5
phi-4 version: https://huggingface.co/mudler/LocalAI-functioncall-phi-4-v0.3
qwen2.5 (7b) version: https://huggingface.co/mudler/LocalAI-functioncall-qwen2.5-7b-v0.5

Enjoy! All the models are available in the LocalAI gallery:

local-ai run LocalAI-functioncall-phi-4-v0.3
local-ai run LocalAI-functioncall-llama3.2-1b-v0.4
local-ai run LocalAI-functioncall-llama3.2-3b-v0.5
local-ai run localai-functioncall-qwen2.5-7b-v0.5

Other models

Numerous model updates and additions:

New models like nightwing3-10b, rombos-qwen2.5-writer, and negative_llama_70b.
Updated checksum for model galleries.
Added icons and improved prompt templates for various models.
Expanded model gallery with new additions like DeepSeek-R1, Mistral-small-24b, and more.

Full changelog :point_down:

:point_right: Click to expand :point_left:

📖 Documentation and examples

docs: update advanced-usage.md to reflect changes in #4700 by @mKenfenheuer in docs: update advanced-usage.md to reflect changes in #4700 mudler/LocalAI#4709

👒 Dependencies

chore: :arrow_up: Update ggerganov/llama.cpp to c05e8c9934f94fde49bc1bc9dc51eed282605150 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to c05e8c9934f94fde49bc1bc9dc51eed282605150 mudler/LocalAI#4579
chore(deps): bump llama.cpp to '924518e2e5726e81f3aeb2518fb85963a500e… by @mudler in chore(deps): bump llama.cpp to '924518e2e5726e81f3aeb2518fb85963a500e… mudler/LocalAI#4592
chore(deps): Bump securego/gosec from 2.21.4 to 2.22.0 by @dependabot in chore(deps): Bump securego/gosec from 2.21.4 to 2.22.0 mudler/LocalAI#4594
chore: :arrow_up: Update ggerganov/llama.cpp to 504af20ee4eae72080a56d59d744f6774f7901ce by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 504af20ee4eae72080a56d59d744f6774f7901ce mudler/LocalAI#4597
chore: :arrow_up: Update ggerganov/llama.cpp to b4d92a59a20eea400d8dd30844a339b76210daa0 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to b4d92a59a20eea400d8dd30844a339b76210daa0 mudler/LocalAI#4606
chore: :arrow_up: Update ggerganov/llama.cpp to adc5dd92e8aea98f5e7ac84f6e1bc15de35130b5 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to adc5dd92e8aea98f5e7ac84f6e1bc15de35130b5 mudler/LocalAI#4612
chore: :arrow_up: Update ggerganov/llama.cpp to 4dbc8b9cb71876e005724f4e8f73a3544646bcf5 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 4dbc8b9cb71876e005724f4e8f73a3544646bcf5 mudler/LocalAI#4618
chore(deps): Bump scipy from 1.14.0 to 1.15.1 in /backend/python/transformers by @dependabot in chore(deps): Bump scipy from 1.14.0 to 1.15.1 in /backend/python/transformers mudler/LocalAI#4621
chore(llama.cpp): update dependency by @mudler in chore(llama.cpp): update dependency mudler/LocalAI#4628
chore: :arrow_up: Update leejet/stable-diffusion.cpp to 5eb15ef4d022bef4a391de4f5f6556e81fbb5024 by @localai-bot in chore: ⬆️ Update leejet/stable-diffusion.cpp to 5eb15ef4d022bef4a391de4f5f6556e81fbb5024 mudler/LocalAI#4636
chore: :arrow_up: Update ggerganov/llama.cpp to a1649cc13f89946322358f92ea268ae1b7b5096c by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to a1649cc13f89946322358f92ea268ae1b7b5096c mudler/LocalAI#4635
chore: :arrow_up: Update ggerganov/llama.cpp to 92bc493917d43b83e592349e138b54c90b1c3ea7 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 92bc493917d43b83e592349e138b54c90b1c3ea7 mudler/LocalAI#4640
chore(deps): Bump docs/themes/hugo-theme-relearn from 80e448e to 8dad5ee by @dependabot in chore(deps): Bump docs/themes/hugo-theme-relearn from 80e448e to 8dad5ee mudler/LocalAI#4656
chore: :arrow_up: Update ggerganov/llama.cpp to aea8ddd5165d525a449e2fc3839db77a71f4a318 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to aea8ddd5165d525a449e2fc3839db77a71f4a318 mudler/LocalAI#4657
chore: :arrow_up: Update ggerganov/llama.cpp to 6171c9d25820ccf676b243c172868819d882848f by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 6171c9d25820ccf676b243c172868819d882848f mudler/LocalAI#4661
chore: :arrow_up: Update ggerganov/llama.cpp to 6152129d05870cb38162c422c6ba80434e021e9f by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 6152129d05870cb38162c422c6ba80434e021e9f mudler/LocalAI#4668
chore(parler-tts): drop backend by @mudler in chore(parler-tts): drop backend mudler/LocalAI#4672
chore: :arrow_up: Update ggerganov/llama.cpp to c5d9effb49649db80a52caf5c0626de6f342f526 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to c5d9effb49649db80a52caf5c0626de6f342f526 mudler/LocalAI#4685
chore: :arrow_up: Update ggerganov/llama.cpp to 26771a1491f3a4c3d5b99c4c267b81aca9a7dfa0 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 26771a1491f3a4c3d5b99c4c267b81aca9a7dfa0 mudler/LocalAI#4690
chore: :arrow_up: Update ggerganov/llama.cpp to 178a7eb952d211b8d4232d5e50ae1b64519172a9 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 178a7eb952d211b8d4232d5e50ae1b64519172a9 mudler/LocalAI#4694
chore(deps): Bump sentence-transformers from 3.3.1 to 3.4.0 in /backend/python/transformers by @dependabot in chore(deps): Bump sentence-transformers from 3.3.1 to 3.4.0 in /backend/python/transformers mudler/LocalAI#4702
chore(deps): Bump docs/themes/hugo-theme-relearn from 8dad5ee to 5bcb9fe by @dependabot in chore(deps): Bump docs/themes/hugo-theme-relearn from 8dad5ee to 5bcb9fe mudler/LocalAI#4704
chore: :arrow_up: Update ggerganov/llama.cpp to a4417ddda98fd0558fb4d802253e68a933704b59 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to a4417ddda98fd0558fb4d802253e68a933704b59 mudler/LocalAI#4705
chore(deps): Bump dependabot/fetch-metadata from 2.2.0 to 2.3.0 by @dependabot in chore(deps): Bump dependabot/fetch-metadata from 2.2.0 to 2.3.0 mudler/LocalAI#4701
chore: :arrow_up: Update ggerganov/llama.cpp to cae9fb4361138b937464524eed907328731b81f6 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to cae9fb4361138b937464524eed907328731b81f6 mudler/LocalAI#4711
chore: :arrow_up: Update ggerganov/llama.cpp to eb7cf15a808d4d7a71eef89cc6a9b96fe82989dc by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to eb7cf15a808d4d7a71eef89cc6a9b96fe82989dc mudler/LocalAI#4717
chore: :arrow_up: Update ggerganov/llama.cpp to 8b576b6c55bc4e6be898b47522f0ef402b93ef62 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 8b576b6c55bc4e6be898b47522f0ef402b93ef62 mudler/LocalAI#4722
chore: :arrow_up: Update ggerganov/llama.cpp to aa6fb1321333fae8853d0cdc26bcb5d438e650a1 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to aa6fb1321333fae8853d0cdc26bcb5d438e650a1 mudler/LocalAI#4728
chore: :arrow_up: Update ggerganov/llama.cpp to 53debe6f3c9cca87e9520a83ee8c14d88977afa4 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 53debe6f3c9cca87e9520a83ee8c14d88977afa4 mudler/LocalAI#4732
chore: :arrow_up: Update ggerganov/llama.cpp to 90f9b88afb6447d3929843a2aa98c0f11074762d by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 90f9b88afb6447d3929843a2aa98c0f11074762d mudler/LocalAI#4736
chore(deps): Bump GrantBirki/git-diff-action from 2.7.0 to 2.8.0 by @dependabot in chore(deps): Bump GrantBirki/git-diff-action from 2.7.0 to 2.8.0 mudler/LocalAI#4746
chore: :arrow_up: Update ggerganov/llama.cpp to 5598f475be3e31430fbe17ebb85654ec90dc201e by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 5598f475be3e31430fbe17ebb85654ec90dc201e mudler/LocalAI#4757
chore(deps): Bump sentence-transformers from 3.4.0 to 3.4.1 in /backend/python/transformers by @dependabot in chore(deps): Bump sentence-transformers from 3.4.0 to 3.4.1 in /backend/python/transformers mudler/LocalAI#4748
chore(deps): Bump docs/themes/hugo-theme-relearn from 5bcb9fe to 66bc366 by @dependabot in chore(deps): Bump docs/themes/hugo-theme-relearn from 5bcb9fe to 66bc366 mudler/LocalAI#4750
chore: :arrow_up: Update ggerganov/llama.cpp to 3ec9fd4b77b6aca03a3c2bf678eae3f9517d6904 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 3ec9fd4b77b6aca03a3c2bf678eae3f9517d6904 mudler/LocalAI#4762
chore: :arrow_up: Update leejet/stable-diffusion.cpp to d46ed5e184b97c2018dc2e8105925bdb8775e02c by @localai-bot in chore: ⬆️ Update leejet/stable-diffusion.cpp to d46ed5e184b97c2018dc2e8105925bdb8775e02c mudler/LocalAI#4769
chore: :arrow_up: Update ggerganov/llama.cpp to d774ab3acc4fee41fbed6dbfc192b57d5f79f34b by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to d774ab3acc4fee41fbed6dbfc192b57d5f79f34b mudler/LocalAI#4770
chore: :arrow_up: Update ggerganov/llama.cpp to 8a59053f63fffc24e730cd3ea067760abfe4a919 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 8a59053f63fffc24e730cd3ea067760abfe4a919 mudler/LocalAI#4776
chore: :arrow_up: Update ggerganov/llama.cpp to d2fe216fb2fb7ca8627618c9ea3a2e7886325780 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to d2fe216fb2fb7ca8627618c9ea3a2e7886325780 mudler/LocalAI#4780
chore: :arrow_up: Update ggerganov/llama.cpp to e6e658319952f7ad269dc11275b9edddc721fc6d by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to e6e658319952f7ad269dc11275b9edddc721fc6d mudler/LocalAI#4787
chore: :arrow_up: Update ggerganov/llama.cpp to 19d3c8293b1f61acbe2dab1d49a17950fd788a4a by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 19d3c8293b1f61acbe2dab1d49a17950fd788a4a mudler/LocalAI#4793
chore(deps): Bump docs/themes/lotusdocs from f5785a2 to 975da91 by @dependabot in chore(deps): Bump docs/themes/lotusdocs from f5785a2 to 975da91 mudler/LocalAI#4801
chore: :arrow_up: Update ggerganov/llama.cpp to 19b392d58dc08c366d0b29bd3b9c6991fa4e1662 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 19b392d58dc08c366d0b29bd3b9c6991fa4e1662 mudler/LocalAI#4803
chore: :arrow_up: Update ggerganov/llama.cpp to 90e4dba461b07e635fd1daf3b491c978c7dd0013 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 90e4dba461b07e635fd1daf3b491c978c7dd0013 mudler/LocalAI#4810
chore: :arrow_up: Update ggerganov/llama.cpp to 0fb77f821f6e70ad8b8247a97d1022f0fef78991 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 0fb77f821f6e70ad8b8247a97d1022f0fef78991 mudler/LocalAI#4814
chore: :arrow_up: Update ggerganov/llama.cpp to 8a8c4ceb6050bd9392609114ca56ae6d26f5b8f5 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 8a8c4ceb6050bd9392609114ca56ae6d26f5b8f5 mudler/LocalAI#4825
chore: :arrow_up: Update ggerganov/llama.cpp to 300907b2110cc17b4337334dc397e05de2d8f5e0 by @localai-bot in chore: ⬆️ Update ggerganov/llama.cpp to 300907b2110cc17b4337334dc397e05de2d8f5e0 mudler/LocalAI#4829

Other Changes

docs: :arrow_up: update docs version mudler/LocalAI by @localai-bot in docs: ⬆️ update docs version mudler/LocalAI mudler/LocalAI#4578
chore(stablediffusion-ggml): disable sycl optimizations by @mudler in chore(stablediffusion-ggml): disable sycl optimizations mudler/LocalAI#4598
chore: alias transformers-musicgen to transformers by @mudler in chore: alias transformers-musicgen to transformers mudler/LocalAI#4623
feat(swagger): update swagger by @localai-bot in feat(swagger): update swagger mudler/LocalAI#4625
feat(swagger): update swagger by @localai-bot in feat(swagger): update swagger mudler/LocalAI#4667
chore(refactor): group cpu cap detection by @mudler in chore(refactor): group cpu cap detection mudler/LocalAI#4674
chore(deps): bump grpcio to 1.70.0 by @mudler in chore(deps): bump grpcio to 1.70.0 mudler/LocalAI#4682
refactor: function argument parsing using named regex by @mKenfenheuer in refactor: function argument parsing using named regex mudler/LocalAI#4708
fix(tests): pin to branch for config used in tests by @mudler in fix(tests): pin to branch for config used in tests mudler/LocalAI#4721
feat(swagger): update swagger by @localai-bot in feat(swagger): update swagger mudler/LocalAI#4735
chore: migrate bruno request files to examples repo by @dave-gray101 in chore: migrate bruno request files to examples repo mudler/LocalAI#4788
chore(tests): decrease parallelism for gRPC builds by @mudler in chore(tests): decrease parallelism for gRPC builds mudler/LocalAI#4797
chore(grpcio): bump to 1.70 by @mudler in chore(grpcio): bump to 1.70 mudler/LocalAI#4798
chore(grpcio): reduce parallelism by @mudler in chore(grpcio): reduce parallelism mudler/LocalAI#4799
chore(swagger): update by @mudler in chore(swagger): update mudler/LocalAI#4805
Revert "chore(deps): Bump docs/themes/lotusdocs from f5785a2 to 975da91" by @mudler in Revert "chore(deps): Bump docs/themes/lotusdocs from f5785a2 to 975da91" mudler/LocalAI#4808
feat(swagger): update swagger by @localai-bot in feat(swagger): update swagger mudler/LocalAI#4809

New Contributors

@petercover made their first contribution in chore: fix some function names in comment mudler/LocalAI#4665
@mKenfenheuer made their first contribution in feat: function argument parsing using named regex mudler/LocalAI#4700

Full Changelog: mudler/LocalAI@v2.25.0...v2.26.0

Updated `grpcio-tools` resource. Co-authored-by: Nanda H Krishna <[email protected]>

github-actions · 2025-02-15T20:57:44Z

🤖 An automated task has requested bottles to be published to this PR.

github-actions bot added go Go use is a significant feature of the PR or issue bump-formula-pr PR was created using `brew bump-formula-pr` labels Feb 15, 2025

localai 2.26.0

5cc973a

Updated `grpcio-tools` resource. Co-authored-by: Nanda H Krishna <[email protected]>

nandahkrishna force-pushed the bump-localai-2.26.0 branch from 13c93cd to 5cc973a Compare February 15, 2025 20:17

carlocab approved these changes Feb 15, 2025

View reviewed changes

localai: update 2.26.0 bottle.

6696593

github-actions bot added the CI-published-bottle-commits The commits for the built bottles have been pushed to the PR branch. label Feb 15, 2025

github-actions bot approved these changes Feb 15, 2025

View reviewed changes

BrewTestBot enabled auto-merge February 15, 2025 20:58

BrewTestBot added this pull request to the merge queue Feb 15, 2025

Merged via the queue into master with commit df8b5ef Feb 15, 2025
15 checks passed

BrewTestBot deleted the bump-localai-2.26.0 branch February 15, 2025 21:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

localai 2.26.0 #207814

localai 2.26.0 #207814

BrewTestBot commented Feb 15, 2025 •

edited by nandahkrishna

Loading

github-actions bot commented Feb 15, 2025

localai 2.26.0 #207814

localai 2.26.0 #207814

Conversation

BrewTestBot commented Feb 15, 2025 • edited by nandahkrishna Loading

:llama: LocalAI v2.26.0!

:warning: Breaking Changes :warning:

Backends that were dropped:

Notable Backend Changes:

New Backends:

New Features 🎉

Bug Fixes 🐛

🧠 Models:

Other models

Full changelog :point_down:

Breaking Changes 🛠

Bug fixes :bug:

Exciting New Features 🎉

🧠 Models

📖 Documentation and examples

👒 Dependencies

Other Changes

New Contributors

github-actions bot commented Feb 15, 2025

BrewTestBot commented Feb 15, 2025 •

edited by nandahkrishna

Loading