rtx-5060ti

Here are 2 public repositories matching this topic...

Andgihat / llama-cpp-mtp-turboquant-sm120-blackwell-windows

Windows prebuilt of llama.cpp combining Multi-Token Prediction (MTP) + TurboQuant KV cache compression + native sm_120 (Blackwell consumer GPU, FP4 tensor cores). For RTX 5060 Ti / 5070 / 5080 / 5090.

windows prebuilt mtp blackwell llama-cpp rtx-5090 cuda-12-8 sm-120 turboquant rtx-50 rtx-5060ti

Updated Jun 5, 2026

Yisau7070 / llama-cpp-mtp-turboquant-sm120-blackwell-windows

Star

Run llama.cpp with Multi-Token Prediction and TurboQuant on Windows using native sm_120 Blackwell support for RTX 50-series GPUs.

windows prebuilt mtp blackwell llama-cpp rtx-5090 sm-120 turboquant rtx-50 rtx-5060ti

Updated Jun 23, 2026

Improve this page

Add a description, image, and links to the rtx-5060ti topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rtx-5060ti topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly