Skip to content

[Solutions] llama.cpp integration #335

@cerb

Description

@cerb

Run the llama.cpp server for any model:

llama-server --hf-repo ggml-org/bge-small-en-v1.5-Q8_0-GGUF \
  --hf-file bge-small-en-v1.5-q8_0.gguf -c 2048 --embeddings --port 9997

Cerb connects using OpenAI standard API endpoints.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions