Skip to content
Change the repository type filter

All

    Repositories list

    • flashinfer

      Public
      FlashInfer: Kernel Library for LLM Serving
      Python
      Apache License 2.0
      7495.1k337127Updated Mar 1, 2026Mar 1, 2026
    • flashinfer-bench

      Public
      Building the Virtuous Cycle for AI-driven LLM Systems
      Python
      Apache License 2.0
      261911311Updated Feb 27, 2026Feb 27, 2026
    • ci-infra

      Public
      Shell
      Apache License 2.0
      1000Updated Feb 20, 2026Feb 20, 2026
    • whl

      Public
      Pre-built wheels for flashinfer python package.
      HTML
      5200Updated Feb 19, 2026Feb 19, 2026
    • mlsys26-agent-baseline

      Public
      Python
      Apache License 2.0
      41500Updated Feb 13, 2026Feb 13, 2026
    • flashinfer-bench-starter-kit

      Public template
      FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels
      Python
      9214171Updated Feb 9, 2026Feb 9, 2026
    • cubloaty

      Public
      a size profiler for cuda binary
      Python
      Apache License 2.0
      07210Updated Jan 15, 2026Jan 15, 2026
    • flashinfer-ai.github.io

      Public
      Project website of FlashInfer project
      SCSS
      4020Updated Jan 3, 2026Jan 3, 2026
    • flashinfer-trace

      Public
      Python
      3202Updated Oct 29, 2025Oct 29, 2025
    • web-data

      Public
      Apache License 2.0
      0000Updated Jun 25, 2025Jun 25, 2025
    • Python
      Apache License 2.0
      36500Updated Apr 26, 2025Apr 26, 2025
    • Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom trace generation (for y…
      Python
      Apache License 2.0
      7100Updated Apr 16, 2025Apr 16, 2025
    • flashinfer-nightly

      Public archive
      FlashInfer Nightly
      MIT License
      1600Updated Apr 9, 2025Apr 9, 2025
    • Apache License 2.0
      0400Updated Apr 2, 2025Apr 2, 2025
    • Jupyter Notebook
      0200Updated Jan 10, 2025Jan 10, 2025
    • Debug print operator for cudagraph debugging
      Cuda
      21411Updated Aug 2, 2024Aug 2, 2024
    • The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
      Other
      16k000Updated Apr 21, 2024Apr 21, 2024
    • candle

      Public
      Minimalist ML framework for Rust
      Rust
      Apache License 2.0
      1.4k000Updated Mar 7, 2024Mar 7, 2024