Skip to content
View mehdifanai's full-sized avatar

Highlights

  • Pro

Block or report mehdifanai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3,664 287 Updated Feb 27, 2025

Production-ready FastAPI wrapper for Zonos TTS models with GPU acceleration, voice cloning, and emotion control. Supports both Transformer and Hybrid variants. ⚠️ UNSTABLE API - INITIAL RELEASE

Python 30 5 Updated Feb 25, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 5,676 571 Updated Feb 18, 2025

A collection of MCP clients.

239 17 Updated Feb 27, 2025

Open source conversation framework and visual editor for structured Pipecat dialogues

Python 229 25 Updated Feb 28, 2025

A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!

TypeScript 29,798 3,013 Updated Mar 1, 2025

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 9,087 626 Updated Mar 1, 2025

Leverage the OpenAI Realtime API (12-17-2024) with this Next.js 15 starter template featuring shadcn/ui components, tool-calling & localization. Use starter to build Voice AI apps with WebRTC.

TypeScript 272 50 Updated Jan 25, 2025

Command Your World with Voice

Python 593 57 Updated Dec 8, 2024

Implementation of F5-TTS in MLX

Python 489 49 Updated Feb 2, 2025

🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用

Rust 35,391 6,359 Updated Feb 23, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,588 611 Updated Feb 28, 2025

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python 1,770 219 Updated Mar 1, 2025
Python 1,461 191 Updated Feb 28, 2025

Witsy: desktop AI assistant

TypeScript 650 49 Updated Mar 1, 2025

Make websites accessible for AI agents

Python 34,567 3,537 Updated Mar 1, 2025

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 25,643 1,540 Updated Feb 28, 2025

Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.

TypeScript 7,132 633 Updated Mar 1, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 225 12 Updated Feb 20, 2025

An extension within PearAI. A fork of Continue: https://github.com/continuedev/continue

TypeScript 77 43 Updated Mar 1, 2025

PearAI: Open Source AI Code Editor (Fork of VSCode). The PearAI Submodule (https://github.com/trypear/pearai-submodule) is a fork of Continue.

TypeScript 469 140 Updated Feb 28, 2025

An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI

Python 3,277 377 Updated Feb 26, 2025

A fast multimodal LLM for real-time voice

Python 3,648 258 Updated Feb 14, 2025

The LLM Evaluation Framework

Python 5,283 442 Updated Feb 28, 2025

Local realtime voice AI

Python 2,233 125 Updated Feb 26, 2025

tiny vision language model

Python 7,499 581 Updated Feb 25, 2025

Tabula is a tool for liberating data tables trapped inside PDF files

CSS 6,921 657 Updated Sep 23, 2024

Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

Python 434 26 Updated Jan 15, 2025
Python 280 25 Updated Dec 4, 2024
Next
Showing results