Skip to content
Change the repository type filter

All

    Repositories list

    • Ongoing research training transformer models at scale
      Python
      3.6k15k312275Updated Feb 4, 2026Feb 4, 2026
    • NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4472.5k7559Updated Feb 4, 2026Feb 4, 2026
    • NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time…
      Python
      42257219Updated Feb 4, 2026Feb 4, 2026
    • Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
      Python
      43214398Updated Feb 4, 2026Feb 4, 2026
    • C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      32991342392Updated Feb 4, 2026Feb 4, 2026
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      411754120Updated Feb 4, 2026Feb 4, 2026
    • BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      11865161118Updated Feb 4, 2026Feb 4, 2026
    • A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models …
      Python
      2541.9k6479Updated Feb 4, 2026Feb 4, 2026
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge device…
      Python
      8866416Updated Feb 4, 2026Feb 4, 2026
    • pants

      Public
      The Pants Build System
      Python
      685300Updated Feb 4, 2026Feb 4, 2026
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      77379213195Updated Feb 4, 2026Feb 4, 2026
    • NeMo-speech-data-processor

      Public
      A toolkit for processing speech data and creating speech datasets
      Python
      42197617Updated Feb 4, 2026Feb 4, 2026
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3322.2k1.2k203Updated Feb 4, 2026Feb 4, 2026
    • nsmd

      Public
      MCTP VDM-based Nvidia System Management API
      C++
      1710Updated Feb 4, 2026Feb 4, 2026
    • Ubuntu kernels which are optimized for NVIDIA server systems
      5490018Updated Feb 4, 2026Feb 4, 2026
    • Kubernetes Device Plugin to help cold plug vfio/iommufd GPUs in Kata VMs for Confidential Containers
      Go
      7237Updated Feb 4, 2026Feb 4, 2026
    • topograph

      Public
      A toolkit for discovering cluster network topology.
      Go
      129623Updated Feb 4, 2026Feb 4, 2026
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1216918221Updated Feb 4, 2026Feb 4, 2026
    • k8s-nim-operator

      Public
      An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
      Go
      37146427Updated Feb 4, 2026Feb 4, 2026
    • RAPIDS Accelerator JNI For Apache Spark
      Cuda
      7753845Updated Feb 4, 2026Feb 4, 2026
    • MIG Partition Editor for NVIDIA GPUs
      Go
      562412220Updated Feb 4, 2026Feb 4, 2026
    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inferen…
      Python
      2.1k13k530501Updated Feb 4, 2026Feb 4, 2026
    • The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
      Shell
      711552936Updated Feb 4, 2026Feb 4, 2026
    • k8s-device-plugin

      Public
      NVIDIA device plugin for Kubernetes
      Go
      7863.7k7036Updated Feb 4, 2026Feb 4, 2026
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwel…
      Python
      6223.1k242120Updated Feb 4, 2026Feb 4, 2026
    • vgpu-device-manager

      Public
      NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
      Go
      2415508Updated Feb 4, 2026Feb 4, 2026
    • numbast

      Public
      Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
      Python
      18572911Updated Feb 4, 2026Feb 4, 2026
    • cutlass

      Public
      CUDA Templates and Python DSLs for High-Performance Linear Algebra
      C++
      1.7k9.2k44497Updated Feb 4, 2026Feb 4, 2026
    • NVFlare

      Public
      NVIDIA Federated Learning Application Runtime Environment
      Python
      2338841520Updated Feb 4, 2026Feb 4, 2026
    • cudaqx

      Public
      Accelerated libraries for quantum-classical computing built on CUDA-Q.
      C++
      47782720Updated Feb 4, 2026Feb 4, 2026