nvidia-blackwell

Here are 12 public repositories matching this topic...

ChiefNakor / comfyui-blackwell-docker

A production-ready Docker setup for ComfyUI that unlocks the full potential of NVIDIA Blackwell GPUs (RTX 50 series) through 4-bit quantization with NVFP4.

docker pytorch nvidia image-generation nvidia-cuda ai-art stable-diffusion comfyui flux-ai nvidia-blackwell nvfp4

Updated Jan 28, 2026
Dockerfile

dconsorte / pytorch-tensorflow-gpu

Star

RTX 5090 & RTX 5060 Docker container with PyTorch + TensorFlow. First fully-tested Blackwell GPU support for ML/AI. CUDA 12.8, Python 3.11, Ubuntu 24.04. Works with RTX 50-series (5090/5080/5070/5060) and RTX 40-series.

docker machine-learning deep-learning tensorflow cuda pytorch gpu-computing blackwell rtx-5090 rtx-5060 blackwell-gpu nvidia-blackwell cuda-12-8 rtx-50-series rtx-5080

Updated Jul 8, 2025
Shell

mmontes11 / opencode-test

Star

Sample application generated using Opencode and Ollama

ai opencode nvidia gpt ai-agent llm ollama nvidia-blackwell gpt-oss gpt-oss-20b

Updated Feb 15, 2026
Go

NvMayMay / nvfp4-lora-spark

Star

NVFP4 LoRA fine-tuning and serving on a single NVIDIA DGX Spark (GB10, 128 GB UMA). Fused Triton dequant; multi-family (Nemotron-3, Mistral-Small-4, Qwen3.x).

Updated Jun 23, 2026
Python

nikhilj202 / comfyui-blackwell-docker

Star

🚀 Accelerate image generation with ComfyUI's Docker for NVIDIA Blackwell GPUs, optimizing speed and memory usage through NVFP4 support.

docker pytorch nvidia image-generation nvidia-cuda ai-art stable-diffusion comfyui flux-ai nvidia-blackwell nvfp4

Updated Jun 23, 2026
HTML

pscamillo / icicle-blackwell-ntt

Star

Empirical characterization of ICICLE NTT on consumer NVIDIA Blackwell (RTX 5070, sm_120), with a prototype for the digit-reversal bottleneck.

performance cryptography gpu cuda gpu-computing profiling icicle zero-knowledge ntt nvidia-blackwell

Updated May 29, 2026
Cuda

gittensor-ai-lab / sparkinfer-moe

Star

Sync-free MoE dispatch engine with CUDA-graph-safe routing for Qwen3.5-35B and Gemma4 on RTX Spark and RTX 5090

cuda moe mixture-of-experts edge-ai nvidia-blackwell cuda-graphs inference-runtime gittensor sn74 rtx-spark

Updated Jun 22, 2026
C++

gittensor-ai-lab / sparkinfer-runtime

Star

Edge AI inference runtime: scheduler, memory manager, CUDA graph engine, KV cache, MoE dispatch

cuda moe edge-ai unified-memory llm-inference nvidia-blackwell inference-runtime gittensor sn74 rtx-spark

Updated Jun 22, 2026
C++

gittensor-ai-lab / sparkinfer-agent

Star

NCU-driven autonomous kernel optimization agent: profile → identify bottleneck → propose variant → compile → benchmark

cuda autotuning edge-ai ai-agent nsight-compute kernel-optimization nvidia-blackwell gittensor sn74 rtx-spark

Updated Jun 22, 2026
Python

gittensor-ai-lab / sparkinfer-bench

Star

Reproducible MoE inference benchmarks for RTX Spark and RTX 5090: flash decode, grouped GEMM, end-to-end generation

benchmarking cuda moe edge-ai llm-inference nvidia-blackwell gittensor sn74 rtx-spark

Updated Jun 22, 2026
Python

gittensor-ai-lab / sparkinfer-kernels

Star

Native C++/CUDA and CuTe DSL kernel library for edge MoE inference: flash decode, sync-free GroupGEMM+SwiGLU, head_dim=512 attention

cuda moe cutlass edge-ai flash-attention nvidia-blackwell cute-dsl gittensor grouped-gemm rtx-spark

Updated Jun 22, 2026
Cuda

cghart / ds4

Star

Fork of antirez/ds4 with CUDA decode optimizations for NVIDIA GB10 / DGX Spark. 15.16 tok/s on DS4-Flash at ctx 7047. See GB10.md.

cuda inference llm gguf deepseek gb10 nvidia-blackwell dgx-spark

Updated May 24, 2026
C

Improve this page

Add a description, image, and links to the nvidia-blackwell topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nvidia-blackwell topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvidia-blackwell

Here are 12 public repositories matching this topic...

ChiefNakor / comfyui-blackwell-docker

dconsorte / pytorch-tensorflow-gpu

mmontes11 / opencode-test

NvMayMay / nvfp4-lora-spark

nikhilj202 / comfyui-blackwell-docker

pscamillo / icicle-blackwell-ntt

gittensor-ai-lab / sparkinfer-moe

gittensor-ai-lab / sparkinfer-runtime

gittensor-ai-lab / sparkinfer-agent

gittensor-ai-lab / sparkinfer-bench

gittensor-ai-lab / sparkinfer-kernels

cghart / ds4

Improve this page

Add this topic to your repo