-
Notifications
You must be signed in to change notification settings - Fork 14.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml-cuda : refactor repetitive switch case statements in mmf
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18260
opened Dec 21, 2025 by
Aadeshveer
Loading…
Add Gemma3n multimodal support with MobileNetV5 vision encoder
examples
model
Model specific
python
python script changes
#18256
opened Dec 21, 2025 by
simrnsingh
Loading…
New quantization type: Q3_HIFI
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18246
opened Dec 21, 2025 by
geoffmunn
Loading…
ggml rpc : Add missing check for rpc buffer type
ggml
changes relating to the ggml tensor library for machine learning
#18242
opened Dec 21, 2025 by
struct
Loading…
ggml-cpu: parallelize tensor repacking with OpenMP
ggml
changes relating to the ggml tensor library for machine learning
#18239
opened Dec 21, 2025 by
pestopoppa
Loading…
cli: buffering info log, only show if model load failed
examples
#18236
opened Dec 20, 2025 by
ngxson
Loading…
llama: fix RPC for -fit on
ggml
changes relating to the ggml tensor library for machine learning
#18233
opened Dec 20, 2025 by
JohannesGaessler
Loading…
server : implement extra_args support for /models/load endpoint
devops
improvements to build systems and github actions
examples
server
#18232
opened Dec 20, 2025 by
Chrisischris
•
Draft
webui: Fix the header backdrop blur
examples
server
#18230
opened Dec 20, 2025 by
ImadSaddik
Loading…
server: /v1/responses (text generation only)
examples
python
python script changes
server
#18227
opened Dec 20, 2025 by
openingnow
•
Draft
webui: use server presets as parameter placeholders
examples
server
#18226
opened Dec 20, 2025 by
ServeurpersoCom
Loading…
ggml-metal: guard buffer map slicing
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#18225
opened Dec 20, 2025 by
SzymonPrajs
Loading…
webui: apply webui_settings on first load
examples
server
#18223
opened Dec 20, 2025 by
ServeurpersoCom
Loading…
ggml-metal: fix memset range and temp buffer leaks
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#18221
opened Dec 20, 2025 by
SzymonPrajs
Loading…
convert: rework ftype heuristics
python
python script changes
#18214
opened Dec 20, 2025 by
taronaeo
Loading…
ggml-metal: fix bf16/f16 matmul kernels
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#18210
opened Dec 20, 2025 by
SzymonPrajs
Loading…
Fix BLAS Compile Definitions
ggml
changes relating to the ggml tensor library for machine learning
#18205
opened Dec 19, 2025 by
DaAwesomeP
Loading…
HIP: Use mmq on MFMA devices for MUL_MAT_ID in cases where a lot of splits would be generated
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18202
opened Dec 19, 2025 by
IMbackK
Loading…
llamafile: add rvv support for sgemm kernels
ggml
changes relating to the ggml tensor library for machine learning
#18199
opened Dec 19, 2025 by
taimur-10x
Loading…
cmake: Added more x86_64 CPU backends when building with changes relating to the ggml tensor library for machine learning
GGML_CPU_ALL_VARIANTS=On
ggml
vulkan: Warptile tuning for Intel Xe2/Xe3
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18178
opened Dec 18, 2025 by
virajwad
Loading…
tool/ex/tests: consistently free ctx, then model
examples
testing
Everything test related
#18168
opened Dec 18, 2025 by
JohannesGaessler
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.