remsky / Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
☆632Updated this week
Alternatives and similar repositories for Kokoro-FastAPI:
Users that are interested in Kokoro-FastAPI are comparing it to the libraries listed below
- Interface for OuteTTS models.☆859Updated this week
- Local SRT/LLM/TTS Voicechat☆590Updated 3 months ago
- A Fast TTS Engine☆405Updated last week
- first base model for full-duplex conversational audio☆1,669Updated last week
- Implementation of F5-TTS in MLX☆429Updated last week
- TTS with kokoro and onnx runtime☆953Updated this week
- Examples for Cerebrium Serverless GPUs☆454Updated this week
- Better WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆391Updated last week
- Efficient visual programming for AI language models☆337Updated 4 months ago
- Local realtime voice AI☆2,162Updated this week
- A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech mode…☆866Updated 2 months ago
- Whisper with Medusa heads☆818Updated 2 weeks ago
- ☆1,110Updated this week
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆863Updated 2 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆738Updated 5 months ago
- Example UI implementing the RTVI web client☆474Updated last month
- a self-hosted webui for 30+ generative ai☆517Updated this week
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆471Updated 4 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆368Updated 2 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆518Updated 3 weeks ago
- podcastfy.ai gradio demo app☆326Updated last month
- A fast multimodal LLM for real-time voice☆2,760Updated this week
- ⚡ Insanely fast AI voice assistant with <500ms response times☆357Updated last month
- Effortlessly run LLM backends, APIs, frontends, and services with one command.☆948Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,492Updated this week
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆555Updated 5 months ago
- ☆502Updated this week
- Excalidraw meets ComfyUI for LLMs☆213Updated 2 weeks ago
- Open source inference code for Rev's model☆361Updated this week