rgcodeai / Kit-WhisperxLinks
This project allows local installation and use of WhisperX WebUI, an advanced audio transcription system based on OpenAI's Whisper but optimized to run on local hardware with or without GPU.
☆18Updated 3 months ago
Alternatives and similar repositories for Kit-Whisperx
Users that are interested in Kit-Whisperx are comparing it to the libraries listed below
Sorting:
- Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially fo…☆394Updated 2 weeks ago
- just unzip and use it with gradio☆65Updated 7 months ago
- The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy.☆461Updated 2 months ago
- Automated speech dataset creator☆194Updated 2 months ago
- Easy to use interface for the Whisper model optimized for all GPUs!☆286Updated last month
- SoTA open-source TTS for Audiobook and Podcast Generation☆158Updated 2 months ago
- A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice …☆511Updated this week
- An ComfyUI custom node integration for multi-language High-quality Text-to-Speech and Voice Conversion nodes using ResembleAI's Chatterbo…☆68Updated last week
- a repository for open webui things explanations☆34Updated 5 months ago
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆193Updated last month
- ACE-Step: A Step Towards Music Generation Foundation Model☆43Updated 3 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆233Updated 4 months ago
- Audiobook Creator is an app that converts books (EPUB, PDF, TXT etc.) into fully voiced audiobooks with intelligent character voice attri…☆329Updated last month
- ☆72Updated 3 months ago
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆494Updated last month
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆218Updated 2 weeks ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆533Updated 2 months ago
- A UI for the Piper TTS☆100Updated last year