-
Notifications
You must be signed in to change notification settings - Fork 12.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
kleidiai: generalize compute_forward_kv_cache to compute_forward_fp16
#15817
opened Sep 5, 2025 by
chaxu01
Loading…
CANN: implement LRU cache for ACL graphs in CANN backend
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15814
opened Sep 5, 2025 by
noemotiovon
Loading…
CUDA: Conv2d Tensor Core
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15813
opened Sep 5, 2025 by
mnehete32
Loading…
CANN: Switch to stream synchronization
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15809
opened Sep 5, 2025 by
noemotiovon
Loading…
Add conv2d Implicit GEMM
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
CUDA: fastdiv, launch bounds for mmvq + q8_1 quant
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15802
opened Sep 4, 2025 by
JohannesGaessler
Loading…
ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
Vulkan
Issues specific to the Vulkan backend
#15797
opened Sep 4, 2025 by
slaren
Loading…
vulkan: support im2col_3d
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15795
opened Sep 4, 2025 by
jeffbolznv
Loading…
vulkan: Support pad_ext
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15794
opened Sep 4, 2025 by
jeffbolznv
Loading…
Add docker:// protocol support for llama-server model pulling
#15790
opened Sep 4, 2025 by
ericcurtin
Loading…
lama
devops
improvements to build systems and github actions
#15788
opened Sep 4, 2025 by
Karen86Tonoyan
Loading…
Add model to header title with mouseover
examples
server
#15787
opened Sep 4, 2025 by
pudepiedj
Loading…
docker : Fix AMDGPU_TARGETS deprecated warnning message
devops
improvements to build systems and github actions
#15786
opened Sep 4, 2025 by
haiyuewa
Loading…
ggml: allow casting between f32 and i32
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15783
opened Sep 4, 2025 by
ngxson
Loading…
4 tasks done
example: add prediction-next-token command line argument handling Example for show probability of next token
examples
#15774
opened Sep 3, 2025 by
no4ni
Loading…
CUDA: faster tile FA (Pascal/AMD), headsize 256
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15769
opened Sep 3, 2025 by
JohannesGaessler
Loading…
CUDA: Add mul_mat_id support for the mmf kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#15767
opened Sep 3, 2025 by
am17an
Loading…
Consolidate multiple tensor copies to reduce API overhead
ggml
changes relating to the ggml tensor library for machine learning
#15750
opened Sep 2, 2025 by
agray3
Loading…
nix: Added missing packages and options for ROCm build
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#15747
opened Sep 2, 2025 by
SteelPh0enix
Loading…
ggml-cpu: fixes instability in NNPA Vector Intrinsics
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
opencl: initial changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
q8_0
mv support
ggml
#15732
opened Sep 2, 2025 by
lhez
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.