Skip to content

Conversation

@bannazz
Copy link

@bannazz bannazz commented Oct 26, 2025

Hello @slaren,
I hope this finds you well.

This release script alteration adds a job that results in two artifacts:

  1. llama-bin-ubuntu-cuda-12.4-x64.zip (the llama.cpp library compiled with cuda support)

  2. cudart-llama-bin-ubuntu-cuda-12.4-x64.zip (stand-alone version that saves the trouble of having to download any third part dependencies saving about 7 GB of dependency downloads and the compilation time)

Tested on Ubuntu 22.04 x64.

I wrote the job with the same conventions as the windows CUDA versions and the Ubuntu release jobs in the release file.

The result is a package exactly like the cpu version with 1 extra file (libggml-cuda.so) in case of the first package or 4 in case of the standalone one (libggml-cuda.so, libcublas.so.12, libcublasLt.so.12, libcudart.so.12)

I only used the Ubuntu and Nvidia officia packages for the build no other third-parties. In other words, it will work for all Ubuntu 22.04 x64 systems.

Best regards.

Adds a job to package a release for ubuntu 22.04 CUDA.
Removed white spaces.
adds a standalone release for ubuntu-22-cuda (works without having to download about 7 Gigs for nvidia's cuda toolkit and deps)
adds support for building from cached builds
sticking to naming conventions to avoid upload script alterations
fixes uploading after build for the cudart zip
@bannazz bannazz requested a review from slaren as a code owner October 26, 2025 05:23
@github-actions github-actions bot added the devops improvements to build systems and github actions label Oct 26, 2025
@bannazz bannazz changed the title Adding CUDA support for Ubuntu Adding CUDA release for Ubuntu Oct 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

devops improvements to build systems and github actions

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant