Skip to content
Change the repository type filter

All

    Repositories list

    • aiter

      Public
      AI Tensor Engine for ROCm
      Python
      MIT License
      1352810Updated Mar 12, 2025Mar 12, 2025
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      13k136187Updated Mar 12, 2025Mar 12, 2025
    • rocMLIR

      Public
      MLIR
      Other
      40137122Updated Mar 12, 2025Mar 12, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2421.1k24799Updated Mar 12, 2025Mar 12, 2025
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      11181990Updated Mar 12, 2025Mar 12, 2025
    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      Apache License 2.0
      5153029Updated Mar 12, 2025Mar 12, 2025
    • TheRock

      Public
      The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm
      CMake
      Apache License 2.0
      1026439Updated Mar 12, 2025Mar 12, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1954123Updated Mar 12, 2025Mar 12, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1613623160Updated Mar 12, 2025Mar 12, 2025
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      75k6902478Updated Mar 12, 2025Mar 12, 2025
    • rocSHMEM

      Public
      rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
      C++
      MIT License
      126183Updated Mar 12, 2025Mar 12, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      6.2k681025Updated Mar 12, 2025Mar 12, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      Python
      MIT License
      1.9k109651Updated Mar 12, 2025Mar 12, 2025
    • Python
      Other
      1222810Updated Mar 12, 2025Mar 12, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      24k2225741Updated Mar 12, 2025Mar 12, 2025
    • A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high-performance computing environments
      C++
      MIT License
      416739Updated Mar 12, 2025Mar 12, 2025
    • hipTensor

      Public
      AMD’s C++ library for accelerating tensor primitives
      C++
      MIT License
      223803Updated Mar 12, 2025Mar 12, 2025
    • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
      Python
      Apache License 2.0
      28k503Updated Mar 12, 2025Mar 12, 2025
    • AMD's graph optimization engine.
      C++
      MIT License
      9621435547Updated Mar 12, 2025Mar 12, 2025
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      50211246Updated Mar 12, 2025Mar 12, 2025
    • Tensile

      Public
      Stretching GPU performance for GEMMs and tensor contractions.
      Python
      MIT License
      15823225Updated Mar 12, 2025Mar 12, 2025
    • Advanced Profiling and Analytics for AMD Hardware
      Python
      Other
      511415215Updated Mar 12, 2025Mar 12, 2025
    • ROCdbgapi

      Public
      The AMD Debugger API is a library that provides all the support necessary for a debugger and other tools to perform low level control of the execution and inspection of execution state of AMD's commercially available GPU architectures.
      C++
      MIT License
      141920Updated Mar 12, 2025Mar 12, 2025
    • ROCgdb

      Public
      This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.
      C
      GNU General Public License v2.0
      105441Updated Mar 12, 2025Mar 12, 2025
    • ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime
      C++
      Other
      1162372124Updated Mar 12, 2025Mar 12, 2025
    • rdc

      Public
      RDC
      C++
      MIT License
      112712Updated Mar 12, 2025Mar 12, 2025
    • amdsmi

      Public
      AMD SMI
      C++
      MIT License
      345469Updated Mar 12, 2025Mar 12, 2025
    • ray

      Public
      Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
      Python
      Apache License 2.0
      6.1k000Updated Mar 11, 2025Mar 11, 2025
    • Libraries integrating migraphx with pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      26157Updated Mar 11, 2025Mar 11, 2025
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1423051228Updated Mar 11, 2025Mar 11, 2025