Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
-
Updated
Mar 6, 2026 - Python
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
On-device speech-to-text engine powered by deep learning
Voice-to-text with push-to-talk for Wayland compositors
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
📱 🏃 🍎 Fitness application that’s used to keep track of your physical fitness data, daily calorie count, invite friends to work out together and ultimately get healthy.
Voice to text, one key to input.
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Privacy-First Voice-to-Text with AI Enhancement for macOS
Chrome Web Speech API
Voice-to-text CLI for terminal users
🎬 KaKa Subtitle Assistant | VideoCaptioner - English Branch - An intelligent subtitle assistant based on LLM and Faster Whisper, one click video and subtitle high speed muxing. No need for discreet GPU. Video sub generating, sentence breaking, proofing...all-in-one. Make subtitles with ease.
GUI for Faster‑Whisper‑XXL transcription tool: download YouTube audio, transcribe local files, manage models, and export multiple formats with themes and auto yt‑dlp updates.
Codo-File is a code editor that primarily supports JavaScript and Python, with partial Dart support. Additionally, it features a real-time website editor where you can create your own website in the browser using HTML, CSS, and JavaScript. The project also includes an image-to-text feature and a voice-to-text feature .
This package can be used to connect Telegram bot to AI engines such as OpenAI ChatGPT, Dall-E, Midjourney, Stable Diffusion, etc.
一个简洁且优秀的描述是:这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展,使用先进的 ASR API。
Kotlin Multiplatform Mobile Translator App
Free ChatGPT voice interaction and integration into python workflows.
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files
macOS voice productivity app — built-in dictation, AI rewrite, and translation. Powered by local Whisper + LLM.
Add a description, image, and links to the voice-to-text topic page so that developers can more easily learn about it.
To associate your repository with the voice-to-text topic, visit your repo's landing page and select "manage topics."