Try host local Mistral-7B-instruct

Quantized versions are here:

https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF

Q4_K_M

https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF/blob/main/mistral-7b-instruct-v0.1.Q4_K_M.gguf


```
from ctransformers import AutoModelForCausalLM

# Set gpu_layers to the number of layers to offload to GPU. Set to 0 if no GPU acceleration is available on your system.
llm = AutoModelForCausalLM.from_pretrained("TheBloke/Mistral-7B-Instruct-v0.1-GGUF", model_file="mistral-7b-instruct-v0.1.Q4_K_M.gguf", model_type="mistral", gpu_layers=50)

print(llm("AI is going to"))
```

see https://github.com/mrseanryan/gpt-workflow/tree/master/local-llm-q

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Try host local Mistral-7B-instruct #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Try host local Mistral-7B-instruct #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions