KoljaB/RealtimeTTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KoljaB/RealtimeTTS)

KoljaB / RealtimeTTS

Converts text to speech in realtime

☆3,990

Alternatives and similar repositories for RealtimeTTS

Users that are interested in RealtimeTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KoljaB / RealtimeSTT
View on GitHub
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…
☆9,990Jun 12, 2026Updated last month
KoljaB / Linguflex
View on GitHub
Command Your World with Voice
☆811Jun 17, 2025Updated last year
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,312Aug 10, 2024Updated last year
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,779Aug 16, 2024Updated last year
KoljaB / LocalAIVoiceChat
View on GitHub
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…
☆726Jun 17, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,364Nov 19, 2025Updated 8 months ago
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,581Dec 10, 2024Updated last year
myshell-ai / MeloTTS
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆7,541Dec 24, 2024Updated last year
ufal / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆3,651Nov 12, 2025Updated 8 months ago
rhasspy / piper
View on GitHub
A fast, local neural text to speech system
☆11,235Aug 26, 2025Updated 10 months ago
pipecat-ai / pipecat
View on GitHub
Open Source framework for voice and multimodal conversational AI
☆13,556Updated this week
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,243Dec 5, 2025Updated 7 months ago
KoljaB / AIVoiceChat
View on GitHub
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
☆320Jun 17, 2025Updated last year
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,625Dec 14, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆14,976Jul 5, 2026Updated 2 weeks ago
metavoiceio / metavoice-src
View on GitHub
Foundational model for human-like, expressive TTS
☆4,203Jul 30, 2024Updated last year
daswer123 / xtts-api-server
View on GitHub
A simple FastAPI Server to run XTTSv2
☆595Jul 21, 2024Updated last year
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,306Jun 9, 2026Updated last month
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆36,974Apr 19, 2025Updated last year
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,612Updated this week
erew123 / alltalk_tts
View on GitHub
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…
☆2,414Jan 9, 2026Updated 6 months ago
speaches-ai / speaches
View on GitHub
☆3,520Updated this week
collabora / WhisperLive
View on GitHub
A nearly-live implementation of OpenAI's Whisper.
☆4,138Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,611May 16, 2026Updated 2 months ago
fixie-ai / ultravox
View on GitHub
A fast multimodal LLM for real-time voice
☆4,476Dec 12, 2025Updated 7 months ago
huggingface / speech-to-speech
View on GitHub
Build local voice agents with open-source models
☆6,159Updated this week
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,136Updated this week
livekit / agents
View on GitHub
A framework for building realtime voice AI agents 🤖🎙️📹
☆11,418Updated this week
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,204Aug 19, 2024Updated last year
remsky / Kokoro-FastAPI
View on GitHub
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/multiplatform CPU, AMD, NVIDIA GPU PyTorch support, handling, and auto-s…
☆5,216Updated this week
edwko / OuteTTS
View on GitHub
Interface for OuteTTS models.
☆1,435Mar 23, 2026Updated 3 months ago
SesameAILabs / csm
View on GitHub
A Conversational Speech Generation Model
☆14,701May 27, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
KoljaB / RealtimeVoiceChat
View on GitHub
Have a natural, spoken conversation with AI!
☆3,799Jul 11, 2025Updated last year
KoljaB / LocalEmotionalAIVoiceChat
View on GitHub
Simulates talk with an AI that can express emotions
☆87Apr 4, 2026Updated 3 months ago
Vaibhavs10 / insanely-fast-whisper
View on GitHub
☆12,988Oct 25, 2025Updated 8 months ago
netease-youdao / EmotiVoice
View on GitHub
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
☆8,489Aug 13, 2024Updated last year
huggingface / distil-whisper
View on GitHub
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
☆4,090Jan 8, 2025Updated last year
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,861Nov 19, 2024Updated last year
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,251May 25, 2026Updated last month