rgcodeai / Kit-WhisperxLinks
This project allows local installation and use of WhisperX WebUI, an advanced audio transcription system based on OpenAI's Whisper but optimized to run on local hardware with or without GPU.
☆21Updated 8 months ago
Alternatives and similar repositories for Kit-Whisperx
Users that are interested in Kit-Whisperx are comparing it to the libraries listed below
Sorting:
- Easy to use interface for the Whisper model optimized for all GPUs!☆458Updated 3 weeks ago
- Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially fo…☆516Updated 5 months ago
- Automated speech dataset creator☆215Updated 7 months ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆343Updated 8 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆283Updated 9 months ago
- The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy.☆486Updated 7 months ago
- ☆233Updated 5 months ago
- Run Orpheus 3B Locally With LM Studio☆32Updated 10 months ago
- EPUB, PDF, DOCX, TXT, and MD file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆274Updated last week
- SoTA open-source TTS for Audiobook and Podcast Generation☆185Updated 7 months ago
- Privacy-first agentic framework with powerful reasoning & task automation capabilities. Natively distributed and fully ISO 27XXX complian…☆68Updated 10 months ago
- just unzip and use it with gradio☆76Updated last year
- ☆83Updated 11 months ago
- A powerful and user-friendly tool that generates detailed captions for your images☆21Updated last year
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆78Updated 8 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Updated 2 months ago
- SmartGallery for ComfyUI is a fast, standalone, browser-based gallery that remembers how every image or video was generated. Workflow-awa…☆238Updated this week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆24Updated 8 months ago
- ☆13Updated 11 months ago
- Completely local data-management platform with built in trainable recommendation engine☆273Updated this week
- ☆73Updated 10 months ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆52Updated 10 months ago
- API server for Instant voice cloning by MyShell.☆107Updated last year
- Face Swap Workflow for Comfy UI☆35Updated 10 months ago
- Fully automated installation scripts for ComfyUI optimized for Intel Arc GPUs (A-Series) and Intel Core Ultra iGPUs with XPU backend, Tri…☆115Updated 2 weeks ago
- ☆31Updated 10 months ago
- Cross-platform, extensible terminal/browser for AI management☆397Updated last week
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆996Updated last month
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆259Updated 3 months ago
- Easily download and archive content from Civitai☆93Updated last week