edwko/OuteTTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/edwko/OuteTTS)

edwko / OuteTTS

Interface for OuteTTS models.

☆1,435

Alternatives and similar repositories for OuteTTS

Users that are interested in OuteTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,244Dec 5, 2025Updated 7 months ago
Standard-Intelligence / hertz-dev
View on GitHub
first base model for full-duplex conversational audio
☆1,794Jan 5, 2025Updated last year
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆14,978Jul 5, 2026Updated 2 weeks ago
MYZY-AI / Muyan-TTS
View on GitHub
☆480May 19, 2025Updated last year
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,581Dec 10, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,311Aug 10, 2024Updated last year
jishengpeng / WavTokenizer
View on GitHub
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
☆1,305Mar 2, 2025Updated last year
zhenye234 / LLaSA_training
View on GitHub
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆660Jan 21, 2026Updated 5 months ago
lucidrains / e2-tts-pytorch
View on GitHub
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
☆516Dec 20, 2025Updated 7 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,014Dec 2, 2025Updated 7 months ago
yl4579 / DMOSpeech2
View on GitHub
☆301Jul 22, 2025Updated 11 months ago
Marvis-Labs / marvis-tts
View on GitHub
☆365Aug 28, 2025Updated 10 months ago
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,613May 16, 2026Updated 2 months ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,327Jun 9, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zhenye234 / FlashSpeech
View on GitHub
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis
☆155Sep 20, 2024Updated last year
KdaiP / StableTTS
View on GitHub
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
☆438Sep 13, 2024Updated last year
DigitalPhonetics / IMS-Toucan
View on GitHub
Controllable and fast Text-to-Speech for over 7000 languages!
☆2,206Jan 25, 2026Updated 5 months ago
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,957Mar 25, 2026Updated 3 months ago
Zyphra / Zonos
View on GitHub
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…
☆7,228Mar 5, 2025Updated last year
Plachtaa / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆3,878Apr 20, 2025Updated last year
Plachtaa / FAcodec
View on GitHub
Training code for FAcodec presented in NaturalSpeech3
☆244Aug 26, 2024Updated last year
metavoiceio / metavoice-src
View on GitHub
Foundational model for human-like, expressive TTS
☆4,203Jul 30, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shivammehta25 / Matcha-TTS
View on GitHub
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
☆1,332Updated this week
zhenye234 / X-Codec-2.0
View on GitHub
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆360Jun 25, 2026Updated 3 weeks ago
gpt-omni / mini-omni
View on GitHub
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…
☆3,562Nov 5, 2024Updated last year
janhq / ichigo
View on GitHub
Local realtime voice AI
☆2,490Nov 26, 2025Updated 7 months ago
Camb-ai / MARS5-TTS
View on GitHub
MARS5 speech model (TTS) from CAMB.AI
☆2,816Aug 1, 2024Updated last year
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
myshell-ai / MeloTTS
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆7,541Dec 24, 2024Updated last year
p0p4k / pflowtts_pytorch
View on GitHub
Unofficial implementation of NVIDIA P-Flow TTS paper
☆228Dec 24, 2024Updated last year
modelscope / ClearerVoice-Studio
View on GitHub
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…
☆4,314Aug 14, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fluxions-ai / vui
View on GitHub
Real-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice c…
☆726Jul 9, 2026Updated last week
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,265May 25, 2026Updated last month
nari-labs / dia
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆19,341Nov 19, 2025Updated 8 months ago
xingchensong / S3Tokenizer
View on GitHub
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
☆517Dec 22, 2025Updated 6 months ago
astramind-ai / Auralis
View on GitHub
A Fast TTS Engine
☆625Jan 23, 2025Updated last year
huggingface / speech-to-speech
View on GitHub
Build local voice agents with open-source models
☆6,165Updated this week
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago