SoTA open-source TTS
☆155Dec 16, 2025Updated 4 months ago
Alternatives and similar repositories for chatterbox
Users that are interested in chatterbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VLLM Port of the Chatterbox TTS model☆375Oct 18, 2025Updated 6 months ago
- Orpheus TTS Server with streaming support (TTFB ~160ms)☆26Sep 21, 2025Updated 7 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- ☆19Feb 23, 2026Updated 2 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆282Jun 15, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Work with your business data using natural language☆19Nov 20, 2024Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆60Feb 24, 2026Updated 2 months ago
- A web application that converts speech to speech 100% private☆86Jun 3, 2025Updated 11 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- Fast LLM swapping with sleep/wake support, compatible with vllm, llama.cpp, etc. llama-swap fork.☆40Apr 5, 2026Updated last month
- ☆18Jul 12, 2025Updated 9 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57May 17, 2025Updated 11 months ago
- Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially fo…☆553Aug 23, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SoTA open-source TTS for Audiobook and Podcast Generation☆200Jun 19, 2025Updated 10 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆297Apr 14, 2025Updated last year
- Example repo showcasing model training and deployment with distil claude cli skill☆56Jan 19, 2026Updated 3 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆214May 9, 2025Updated 11 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆50May 20, 2025Updated 11 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- ☆24May 22, 2024Updated last year
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆29Aug 6, 2025Updated 9 months ago
- A professional-grade interface for Qwen3-TTS, designed to unlock the model's full potential with fine-grained control and intuitive workf…☆277Mar 30, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Realtime demo, Streaming and Finetuning code for CSM☆455Sep 17, 2025Updated 7 months ago
- Automatic1111 port of my comfyUI geely remb tool☆17Oct 24, 2024Updated last year
- Sparse Inferencing for transformer based LLMs☆218Mar 25, 2026Updated last month
- OpenCode plugin for Anthropic Claude Pro/Max OAuth login — no Claude Code needed.☆109Apr 9, 2026Updated 3 weeks ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆105Feb 16, 2026Updated 2 months ago
- ☆32Jan 28, 2025Updated last year
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆80May 19, 2025Updated 11 months ago
- ☆19Feb 4, 2026Updated 3 months ago
- ☆22Sep 20, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Dec 16, 2024Updated last year
- Service for testing out the new Qwen2.5 omni model☆63Apr 30, 2025Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆50Sep 15, 2025Updated 7 months ago
- Repo for YouTube tutorial on how to self-host a MinIo instance and connect a Next.js 15 app to MinIo and upload files☆18Jan 15, 2025Updated last year
- Fork of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆17Nov 27, 2024Updated last year
- ☆16Jan 24, 2026Updated 3 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆38Jul 2, 2025Updated 10 months ago