devnen / Kitten-TTS-ServerLinks
Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiobooks, and GPU acceleration.
☆233Updated 5 months ago
Alternatives and similar repositories for Kitten-TTS-Server
Users that are interested in Kitten-TTS-Server are comparing it to the libraries listed below
Sorting:
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆336Updated this week
- A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automat…☆142Updated this week
- VLLM Port of the Chatterbox TTS model☆364Updated 3 months ago
- ☆385Updated 2 months ago
- An open-source implementation of Whisper☆475Updated 3 months ago
- This project is a collection of Docker-based web user interfaces designed to easily run various state-of-the-art generative AI models loc…☆388Updated last week
- Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model reco…☆227Updated 6 months ago
- Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.☆160Updated last week
- TTS model capable of streaming conversational audio in realtime.☆1,027Updated 2 months ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆340Updated 8 months ago
- ☆502Updated this week
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆269Updated last month
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆246Updated 5 months ago
- ☆200Updated 10 months ago
- Dashboard v5 Coming Soon!!☆63Updated 3 weeks ago
- Fast local speech-to-text for any app using faster-whisper☆145Updated 4 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆382Updated last week
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆127Updated 4 months ago
- A high quality and fast TTS repository☆486Updated last month
- ComfyUI node for highly expressive speech and realistic zero-shot voice cloning☆377Updated last month
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆59Updated 2 months ago
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆165Updated last month
- ☆178Updated 5 months ago
- A web application that converts speech to speech 100% private☆82Updated 7 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆281Updated 9 months ago
- Clean, polished interface for Tencent’s SongGeneration. Create songs from text prompts or reference audio, with batch processing and smar…☆340Updated this week
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆129Updated 5 months ago
- Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.☆249Updated last week
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆108Updated 2 months ago
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆30Updated 8 months ago