Skip to content
Change the repository type filter

All

    Repositories list

    • Dream-E

      Public
      TypeScript
      0200Updated Apr 13, 2026Apr 13, 2026
    • Building an agentic voice assistant for mobile & desktop devices with episodic, semantic & procedural memories
      Apache License 2.0
      0100Updated Apr 12, 2026Apr 12, 2026
    • Admin Bud-E is a lightweight, privacy-first control center for AI chat, speech-to-text, and text-to-speech. Manage providers, routing, and costs with a simple A…
      Python
      Apache License 2.0
      2100Updated Apr 12, 2026Apr 12, 2026
    • School Bud-E is an intelligent and empathetic learning assistant designed to revolutionize the educational experience.
      TypeScript
      Apache License 2.0
      3100Updated Apr 12, 2026Apr 12, 2026
    • Retrieval-augmented voice cloning and emotion conditioning data generation pipeline. Combines Echo TTS, ChatterboxVC, and Empathic Insight Voice+ to generate la…
      Python
      Other
      0300Updated Apr 3, 2026Apr 3, 2026
    • Multi-node scaling benchmarks for CLAP contrastive audio-language models on HPC clusters
      Python
      0000Updated Mar 29, 2026Mar 29, 2026
    • Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024 GPUs)
      Python
      0500Updated Mar 29, 2026Mar 29, 2026
    • JAX/TPU training code for EchoTTS with DACVAE latent codec
      Python
      0000Updated Mar 29, 2026Mar 29, 2026
    • laion.ai

      Public
      HTML
      MIT License
      4612253Updated Mar 27, 2026Mar 27, 2026
    • High-level Python library for zero-shot voice conversion using Resemble AI's Chatterbox S3Gen model
      Python
      Apache License 2.0
      0100Updated Mar 23, 2026Mar 23, 2026
    • Jupyter Notebook
      41900Updated Mar 22, 2026Mar 22, 2026
    • CLIP-like model evaluation
      Python
      MIT License
      104813296Updated Mar 19, 2026Mar 19, 2026
    • Python
      1010920Updated Feb 28, 2026Feb 28, 2026
    • Open-weights voice acting pipeline combining zero-shot voice cloning with natural-language direction. Provide a reference voice (or generate one) and describe h…
      HTML
      Apache License 2.0
      01400Updated Feb 16, 2026Feb 16, 2026
    • Apache License 2.0
      0800Updated Feb 13, 2026Feb 13, 2026
    • AIW

      Public
      Alice in Wonderland code base for experiments and raw experiments data
      Python
      Apache License 2.0
      1013121Updated Feb 4, 2026Feb 4, 2026
    • Audio Dataset for training CLAP and other models
      Python
      59734215Updated Jan 8, 2026Jan 8, 2026
    • vocolino

      Public
      Apache License 2.0
      0000Updated Dec 10, 2025Dec 10, 2025
    • OpenCLIP fork with MaMMUT support
      Python
      Other
      2511Updated Nov 10, 2025Nov 10, 2025
    • MegaTron open-sci fork
      Python
      Other
      3.8k700Updated Oct 29, 2025Oct 29, 2025
    • A frontend that is compatible to the school-bud-e-backend.
      TypeScript
      MIT License
      102201Updated Oct 23, 2025Oct 23, 2025
    • Official repository for the NeurIPS 2025 paper “EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition.” Includes a 40-category emotion ta…
      Jupyter Notebook
      MIT License
      0300Updated Oct 20, 2025Oct 20, 2025
    • TypeScript
      Apache License 2.0
      3000Updated Sep 6, 2025Sep 6, 2025
    • Python
      0000Updated Sep 6, 2025Sep 6, 2025
    • TypeScript
      Apache License 2.0
      3000Updated Aug 30, 2025Aug 30, 2025
    • Python
      0200Updated Aug 18, 2025Aug 18, 2025
    • Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
      Jupyter Notebook
      1119010Updated Jun 21, 2025Jun 21, 2025
    • 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference an…
      Python
      Apache License 2.0
      33k100Updated Jun 12, 2025Jun 12, 2025
    • 0000Updated Jun 11, 2025Jun 11, 2025
    • CLAP

      Public
      Contrastive Language-Audio Pretraining
      Python
      Creative Commons Zero v1.0 Universal
      2132.1k624Updated May 15, 2025May 15, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.