devnen / Kitten-TTS-ServerLinks
Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiobooks, and GPU acceleration.
☆234Updated 6 months ago
Alternatives and similar repositories for Kitten-TTS-Server
Users that are interested in Kitten-TTS-Server are comparing it to the libraries listed below
Sorting:
- VLLM Port of the Chatterbox TTS model☆364Updated 3 months ago
- A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automat…☆276Updated last week
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆632Updated last week
- ☆386Updated 3 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆128Updated 5 months ago
- A web application that converts speech to speech 100% private☆82Updated 8 months ago
- A professional-grade interface for Qwen3-TTS, designed to unlock the model's full potential with fine-grained control and intuitive workf…☆154Updated this week
- Clean, polished interface for Tencent’s SongGeneration. Create songs from text prompts or reference audio, with batch processing and smar…☆344Updated 2 weeks ago
- This project is a collection of Docker-based web user interfaces designed to easily run various state-of-the-art generative AI models loc…☆399Updated 3 weeks ago
- ☆205Updated 5 months ago
- ☆178Updated 5 months ago
- Chain apps and models to build robust AI workflows 🤗☆424Updated this week
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆323Updated last month
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆283Updated 9 months ago
- Fast local speech-to-text for any app using faster-whisper☆147Updated this week
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆133Updated last week
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆964Updated 2 weeks ago
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆166Updated last month
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆286Updated 5 months ago
- Open Source Locally Hosted Lovable with Full Stack Support☆353Updated last month
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆344Updated 8 months ago
- ☆201Updated 10 months ago
- The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy.☆486Updated 7 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆110Updated 2 months ago
- Kroko ASR - Speech-to-text☆130Updated 4 months ago
- Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI AP…☆514Updated last month
- Local voice AI powered by Ollama, Kokoro, Whisper, and LiveKit.☆397Updated last month
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆282Updated last month
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆388Updated 2 weeks ago
- A lightweight UI for chatting with Ollama models. Streaming responses, conversation history, and multi-model support.☆149Updated 10 months ago