remsky / Kokoro-FastAPILinks

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

☆3,382

Alternatives and similar repositories for Kokoro-FastAPI

Users that are interested in Kokoro-FastAPI are comparing it to the libraries listed below

Sorting:

hexgrad / kokoro
https://hf.co/hexgrad/Kokoro-82M
☆3,805Updated last week
thewh1teagle / kokoro-onnx
TTS with kokoro and onnx runtime
☆2,129Updated last month
nazdridoy / kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…
☆610Updated last week
canopyai / Orpheus-TTS
Towards Human-Sounding Speech
☆5,327Updated 3 months ago
speaches-ai / speaches
☆2,179Updated this week
matatonic / openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆797Updated 6 months ago
idiap / coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆1,659Updated 2 weeks ago
KoljaB / RealtimeTTS
Converts text to speech in realtime
☆3,338Updated 2 weeks ago
edwko / OuteTTS
Interface for OuteTTS models.
☆1,346Updated last month
Lex-au / Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆490Updated last month
erew123 / alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…
☆1,960Updated 3 weeks ago
travisvn / openai-edge-tts
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
☆1,070Updated last month
kyutai-labs / delayed-streams-modeling
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,171Updated this week
menloresearch / ichigo
Local realtime voice AI
☆2,343Updated 5 months ago
Zyphra / Zonos
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…
☆6,878Updated 5 months ago
KoljaB / RealtimeVoiceChat
Have a natural, spoken conversation with AI!
☆2,875Updated 3 weeks ago
collabora / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
☆3,190Updated last week
resemble-ai / chatterbox
SoTA open-source TTS
☆9,802Updated this week
kyutai-labs / hibiki
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,252Updated 3 months ago
isaiahbjork / orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
☆446Updated 4 months ago
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆702Updated 9 months ago
KoljaB / RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…
☆8,278Updated 3 weeks ago
rsxdalv / TTS-WebUI
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice,…
☆2,407Updated 3 weeks ago
astramind-ai / Auralis
A Fast TTS Engine
☆529Updated 6 months ago
Standard-Intelligence / hertz-dev
first base model for full-duplex conversational audio
☆1,748Updated 7 months ago
bytedance / MegaTTS3
☆5,694Updated 2 months ago
fixie-ai / ultravox
A fast multimodal LLM for real-time voice
☆4,124Updated last week
ace-step / ACE-Step
ACE-Step: A Step Towards Music Generation Foundation Model
☆2,782Updated last month
PierrunoYT / Kokoro-TTS-Local
A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…
☆204Updated 2 weeks ago
KoljaB / Linguflex
Command Your World with Voice
☆737Updated last month