nazdridoy / kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
☆218Updated last week
Alternatives and similar repositories for kokoro-tts:
Users that are interested in kokoro-tts are comparing it to the libraries listed below
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆117Updated 2 weeks ago
- Interface for OuteTTS models.☆940Updated 2 weeks ago
- A Fast TTS Engine☆459Updated last month
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆172Updated last month
- G2P☆155Updated this week
- Implementation of F5-TTS in MLX☆489Updated last month
- API server for Instant voice cloning by MyShell.☆86Updated 5 months ago
- ☆95Updated 10 months ago
- Slightly improved official version for finetune xtts☆318Updated 4 months ago
- 🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.☆413Updated last week
- Open source inference code for Rev's model☆381Updated last month
- ☆58Updated 5 months ago
- https://hf.co/hexgrad/Kokoro-82M☆1,344Updated this week
- Running the F5-TTS by ONNX Runtime☆109Updated this week
- A simple FastAPI Server to run XTTSv2☆479Updated 7 months ago
- Local SRT/LLM/TTS Voicechat☆625Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆153Updated 7 months ago
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆1,796Updated this week
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆94Updated 2 weeks ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆597Updated 2 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆380Updated 2 weeks ago
- Whisper with Medusa heads☆823Updated this week
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs☆542Updated last month
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆319Updated 2 months ago
- TTS with kokoro and onnx runtime☆1,671Updated this week
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆170Updated 2 weeks ago