-
Notifications
You must be signed in to change notification settings - Fork 13.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix missing messages on sibling navigation
examples
server
#16408
opened Oct 3, 2025 by
allozaur
Loading…
server / ranking : add sorting and management of top_n
examples
server
#16403
opened Oct 3, 2025 by
YannFollet
Loading…
ggml webgpu: actually add softmax, fix rms_norm offset
ggml
changes relating to the ggml tensor library for machine learning
#16400
opened Oct 3, 2025 by
reeselevine
Loading…
ggml : fix graph reallocation with multiple chunks
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16396
opened Oct 3, 2025 by
Acly
Loading…
refactor: centralize CoT parsing in backend for streaming mode
examples
server
testing
Everything test related
#16394
opened Oct 2, 2025 by
ServeurpersoCom
Loading…
tests : add -INF blocks to the KQ mask in the FA tests
testing
Everything test related
#16380
opened Oct 2, 2025 by
ggerganov
Loading…
metal : index FA blocks
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16372
opened Oct 1, 2025 by
ggerganov
Loading…
model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules
python
python script changes
#16367
opened Oct 1, 2025 by
sfallah
Loading…
Add support to New feature or request
examples
server/webui
server
◁think▷...◁/think▷
format and DRY the thinking processing logic
enhancement
#16364
opened Sep 30, 2025 by
allozaur
Loading…
Add ARANGE Operator to SYCL Backend (Small & Focused Changes)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16362
opened Sep 30, 2025 by
GittyBurstein
Loading…
feat: render user content as markdown option
examples
server
#16358
opened Sep 30, 2025 by
ServeurpersoCom
Loading…
vulkan: Replace uses of maxMemoryAllocationSize and VK_WHOLE_SIZE
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16354
opened Sep 30, 2025 by
jeffbolznv
Loading…
SYCL SET operator optimized for F32 tensors
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16350
opened Sep 30, 2025 by
GittyBurstein
Loading…
Update build.md
documentation
Improvements or additions to documentation
#16346
opened Sep 30, 2025 by
refine360-debug
Loading…
vulkan : incremental shader builds
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16341
opened Sep 29, 2025 by
Acly
Loading…
ggml-cpu : inspect -march and -mcpu to found the CPU
ggml
changes relating to the ggml tensor library for machine learning
#16333
opened Sep 29, 2025 by
angt
Loading…
ggml : fix unaligned access in AMX code
ggml
changes relating to the ggml tensor library for machine learning
#16315
opened Sep 28, 2025 by
ggerganov
Loading…
ggml : remove SVE paths
ggml
changes relating to the ggml tensor library for machine learning
#16314
opened Sep 28, 2025 by
ggerganov
Loading…
Enable Intel AMX acceleration while in CPU/GPU hybrid with new "--amx" toggle.
examples
#16310
opened Sep 28, 2025 by
Gadflyii
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.