Skip to content

Commit 3d12d20

Browse files
authored
models(gallery): add llama-3.2 3B and 1B (#3671)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent d6522e6 commit 3d12d20

File tree

1 file changed

+60
-0
lines changed

1 file changed

+60
-0
lines changed

gallery/index.yaml

Lines changed: 60 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,64 @@
11
---
2+
## llama3.2
3+
- &llama32
4+
url: "github:mudler/LocalAI/gallery/llama3.1-instruct.yaml@master"
5+
icon: https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png
6+
license: llama3.2
7+
description: |
8+
The Meta Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out). The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks. They outperform many of the available open source and closed chat models on common industry benchmarks.
9+
10+
Model Developer: Meta
11+
12+
Model Architecture: Llama 3.2 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
13+
tags:
14+
- llm
15+
- gguf
16+
- gpu
17+
- cpu
18+
- llama3.2
19+
name: "llama-3.2-1b-instruct:q4_k_m"
20+
urls:
21+
- https://huggingface.co/hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
22+
overrides:
23+
parameters:
24+
model: llama-3.2-1b-instruct-q4_k_m.gguf
25+
files:
26+
- filename: llama-3.2-1b-instruct-q4_k_m.gguf
27+
sha256: 1d0e9419ec4e12aef73ccf4ffd122703e94c48344a96bc7c5f0f2772c2152ce3
28+
uri: huggingface://hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF/llama-3.2-1b-instruct-q4_k_m.gguf
29+
- !!merge <<: *llama32
30+
name: "llama-3.2-3b-instruct:q4_k_m"
31+
urls:
32+
- https://huggingface.co/hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF
33+
overrides:
34+
parameters:
35+
model: llama-3.2-3b-instruct-q4_k_m.gguf
36+
files:
37+
- filename: llama-3.2-3b-instruct-q4_k_m.gguf
38+
sha256: c55a83bfb6396799337853ca69918a0b9bbb2917621078c34570bc17d20fd7a1
39+
uri: huggingface://hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF/llama-3.2-3b-instruct-q4_k_m.gguf
40+
- !!merge <<: *llama32
41+
name: "llama-3.2-3b-instruct:q8_0"
42+
urls:
43+
- https://huggingface.co/hugging-quants/Llama-3.2-3B-Instruct-Q8_0-GGUF
44+
overrides:
45+
parameters:
46+
model: llama-3.2-3b-instruct-q8_0.gguf
47+
files:
48+
- filename: llama-3.2-3b-instruct-q8_0.gguf
49+
sha256: 51725f77f997a5080c3d8dd66e073da22ddf48ab5264f21f05ded9b202c3680e
50+
uri: huggingface://hugging-quants/Llama-3.2-3B-Instruct-Q8_0-GGUF/llama-3.2-3b-instruct-q8_0.gguf
51+
- !!merge <<: *llama32
52+
name: "llama-3.2-1b-instruct:q8_0"
53+
urls:
54+
- https://huggingface.co/hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF
55+
overrides:
56+
parameters:
57+
model: llama-3.2-1b-instruct-q8_0.gguf
58+
files:
59+
- filename: llama-3.2-1b-instruct-q8_0.gguf
60+
sha256: ba345c83bf5cc679c653b853c46517eea5a34f03ed2205449db77184d9ae62a9
61+
uri: huggingface://hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF/llama-3.2-1b-instruct-q8_0.gguf
262
## Qwen2.5
363
- &qwen25
464
name: "qwen2.5-14b-instruct"

0 commit comments

Comments
 (0)