Skip to content

Commit af426d3

Browse files
mudlergithub-actions[bot]
authored andcommitted
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
1 parent 41de7d3 commit af426d3

File tree

1 file changed

+32
-0
lines changed

1 file changed

+32
-0
lines changed

gallery/index.yaml

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,36 @@
11
---
2+
- name: "meta-llama-3.1-70b-instruct-malaysian"
3+
url: "github:mudler/LocalAI/gallery/virtual.yaml@master"
4+
urls:
5+
- https://huggingface.co/mradermacher/Meta-Llama-3.1-70B-Instruct-Malaysian-GGUF
6+
description: |
7+
The model is a quantized version of **Meta-Llama-3.1-70B-Instruct** (base model: `meta-llama/Llama-3.1-70B-Instruct`), optimized for efficiency with SFT LoRA training. Key features include:
8+
- **Rank 256** linear layers with α × 2.0
9+
- **16384 context length** (multipacked with 32-bit batch size)
10+
- **Liger fused cross-entropy**
11+
- **1e-4 learning rate** (50 warmup, 3 epochs)
12+
- Quantized for deployment (e.g., Q4_K_S, Q8_0)
13+
14+
This version is derived from SFT LoRA training on the `Scicom-intl/Malaysian-Instructions` dataset, with source code available at [this link](https://github.com/Scicom-AI-Enterprise-Organization/small-ablation).
15+
overrides:
16+
parameters:
17+
model: llama-cpp/models/Meta-Llama-3.1-70B-Instruct-Malaysian.Q4_K_M.gguf
18+
name: Meta-Llama-3.1-70B-Instruct-Malaysian-GGUF
19+
backend: llama-cpp
20+
template:
21+
use_tokenizer_template: true
22+
known_usecases:
23+
- chat
24+
function:
25+
grammar:
26+
disable: true
27+
description: Imported from https://huggingface.co/mradermacher/Meta-Llama-3.1-70B-Instruct-Malaysian-GGUF
28+
options:
29+
- use_jinja:true
30+
files:
31+
- filename: llama-cpp/models/Meta-Llama-3.1-70B-Instruct-Malaysian.Q4_K_M.gguf
32+
sha256: 314c0ffd4381049259c88e95f5ce693581d2808471fc195761bd59b96635004c
33+
uri: https://huggingface.co/mradermacher/Meta-Llama-3.1-70B-Instruct-Malaysian-GGUF/resolve/main/Meta-Llama-3.1-70B-Instruct-Malaysian.Q4_K_M.gguf
234
- name: "rwkv7-g1c-13.3b"
335
url: "github:mudler/LocalAI/gallery/virtual.yaml@master"
436
urls:

0 commit comments

Comments
 (0)