The open-source voice synthesis studio powered by Qwen3-TTS.
-
Updated
Feb 23, 2026 - TypeScript
The open-source voice synthesis studio powered by Qwen3-TTS.
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects
Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning, voice design, custom voices. 100% offline using MLX.
MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic MCP Support
Japanese GUI + Whisper auto-transcription for Qwen3-TTS. RTX 5090 tested.
Home Assistant integrates Alibaba Cloud's BaiLian Platform TTS
Enhanced Qwen3-TTS voice cloning GUI with multi-reference samples, variation generation, and audio preprocessing.
Free voice cloning and TTS for creators using Qwen3-TTS on Google Colab. Clone your voice with just a few seconds of audio. Complete guide to build your own notebook.
Qwen3-TTS Audiobook Studio: Ultimate local multi-role AI audiobook generator. Built-in 3s Voice Clone & Design. Portable one-click launch for Mac/Win. 极致本地 AI 有声书制作工坊。
🗣 Java Text to Speech (JSAPI2) engines (google cloud, cocoa, open jtalk, aquestalk(ゆっくり), voicevox(ずんだもん), coeiroink, aivisspeech, google genai, qwen3-tts)
🎙️ Qwen3-TTS-DubFlow: An open-source, human-in-the-loop AI dubbing workbench for novels, games, podcasts, and more. Features a "Design-then-Clone" workflow powered by Qwen3-TTS to achieve consistent identity and context-aware emotional performance.
A desktop interface for the powerful Qwen3-TTS model (1.7B CustomVoice). Run offline, ultra-low latency text-to-speech with emotive control directly on your GPU.
A Text to Speech App for Qwen3-TTS Family Models to create custom voices, voice cloning with minimal effort.
Easy fine-tuning for Qwen3-TTS: Fast voice cloning and high-quality multilingual speech synthesis.
Add a description, image, and links to the qwen3-tts topic page so that developers can more easily learn about it.
To associate your repository with the qwen3-tts topic, visit your repo's landing page and select "manage topics."