SWivid/F5-TTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SWivid/F5-TTS)

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

☆14,976

Alternatives and similar repositories for F5-TTS

Users that are interested in F5-TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,251May 25, 2026Updated last month
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,306Jun 9, 2026Updated last month
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,643Apr 10, 2026Updated 3 months ago
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,581Dec 10, 2024Updated last year
RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆59,925Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SparkAudio / Spark-TTS
View on GitHub
Spark-TTS Inference Code
☆10,998Apr 9, 2025Updated last year
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,779Aug 16, 2024Updated last year
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,956Mar 25, 2026Updated 3 months ago
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,312Aug 10, 2024Updated last year
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆36,974Apr 19, 2025Updated last year
bytedance / LatentSync
View on GitHub
Taming Stable Diffusion for Lip Sync!
☆5,890Jun 20, 2025Updated last year
FireRedTeam / FireRedTTS
View on GitHub
An Open-Sourced LLM-empowered Foundation TTS System
☆907Sep 28, 2025Updated 9 months ago
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,243Dec 5, 2025Updated 7 months ago
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,611May 16, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
FunAudioLLM / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,883Updated this week
Plachtaa / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆3,876Apr 20, 2025Updated last year
index-tts / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆21,976Updated this week
zai-org / GLM-4-Voice
View on GitHub
GLM-4-Voice | 端到端中英语音对话模型
☆3,204Dec 5, 2024Updated last year
edwko / OuteTTS
View on GitHub
Interface for OuteTTS models.
☆1,435Mar 23, 2026Updated 3 months ago
myshell-ai / MeloTTS
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆7,541Dec 24, 2024Updated last year
shivammehta25 / Matcha-TTS
View on GitHub
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
☆1,332Updated this week
bytedance / MegaTTS3
View on GitHub
☆6,084Jun 15, 2026Updated last month
stepfun-ai / Step-Audio
View on GitHub
☆32Mar 16, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Zyphra / Zonos
View on GitHub
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…
☆7,229Mar 5, 2025Updated last year
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,204Aug 19, 2024Updated last year
KlingAIResearch / LivePortrait
View on GitHub
Bring portraits to life!
☆18,765Jun 1, 2026Updated last month
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,332Updated this week
lucidrains / e2-tts-pytorch
View on GitHub
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
☆516Dec 20, 2025Updated 6 months ago
antgroup / echomimic_v2
View on GitHub
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
☆4,617Feb 23, 2026Updated 4 months ago
nari-labs / dia
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆19,338Nov 19, 2025Updated 8 months ago
modelscope / ClearerVoice-Studio
View on GitHub
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…
☆4,313Aug 14, 2025Updated 11 months ago
antgroup / echomimic
View on GitHub
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
☆4,269Apr 7, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jasonppy / VoiceCraft
View on GitHub
Zero-Shot Speech Editing and Text-to-Speech in the Wild
☆8,495May 30, 2026Updated last month
gpt-omni / mini-omni
View on GitHub
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…
☆3,562Nov 5, 2024Updated last year
TMElyralab / MuseTalk
View on GitHub
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
☆6,190Sep 26, 2025Updated 9 months ago
facefusion / facefusion
View on GitHub
Industry leading face manipulation platform
☆29,319Updated this week
rany2 / edge-tts
View on GitHub
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
☆11,525Mar 22, 2026Updated 3 months ago
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,364Nov 19, 2025Updated 8 months ago
SesameAILabs / csm
View on GitHub
A Conversational Speech Generation Model
☆14,701May 27, 2025Updated last year