neonbjb/tortoise-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neonbjb/tortoise-tts)

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

☆14,865

Alternatives and similar repositories for tortoise-tts

Users that are interested in tortoise-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,827Aug 16, 2024Updated last year
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,214Aug 19, 2024Updated last year
152334H / tortoise-tts-fast
View on GitHub
Fast TorToiSe inference (5x or your money back!)
☆826Jul 10, 2024Updated 2 years ago
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,320Aug 10, 2024Updated last year
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,580Dec 10, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆37,038Apr 19, 2025Updated last year
enhuiz / vall-e
View on GitHub
An unofficial PyTorch implementation of the audio LM VALL-E
☆2,979May 10, 2023Updated 3 years ago
jasonppy / VoiceCraft
View on GitHub
Zero-Shot Speech Editing and Text-to-Speech in the Wild
☆8,506May 30, 2026Updated last month
facebookresearch / audiocraft
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆23,525Mar 3, 2026Updated 4 months ago
rhasspy / piper
View on GitHub
A fast, local neural text to speech system
☆11,270Aug 26, 2025Updated 11 months ago
metavoiceio / metavoice-src
View on GitHub
Foundational model for human-like, expressive TTS
☆4,203Jul 30, 2024Updated last year
serp-ai / bark-with-voice-clone
View on GitHub
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
☆3,338Aug 24, 2025Updated 11 months ago
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,304Jul 13, 2026Updated 2 weeks ago
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,503Jun 2, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆105,839Apr 15, 2026Updated 3 months ago
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆52,364Jul 11, 2026Updated 2 weeks ago
CorentinJ / Real-Time-Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆60,068Mar 9, 2026Updated 4 months ago
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,625Dec 14, 2025Updated 7 months ago
lucidrains / audiolm-pytorch
View on GitHub
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
☆2,623Jan 12, 2025Updated last year
Plachtaa / VALL-E-X
View on GitHub
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
☆7,934Feb 11, 2024Updated 2 years ago
LAION-AI / Open-Assistant
View on GitHub
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,397Aug 17, 2024Updated last year
mozilla / TTS
View on GitHub
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10,165Nov 9, 2023Updated 2 years ago
lifeiteng / vall-e
View on GitHub
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
☆2,208Sep 10, 2025Updated 10 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆15,039Updated this week
neonbjb / ocotillo
View on GitHub
Performant and accurate speech recognition built on Pytorch
☆254May 19, 2022Updated 4 years ago
rsxdalv / TTS-WebUI
View on GitHub
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro,…
☆3,216Updated this week
Rudrabha / Wav2Lip
View on GitHub
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…
☆13,126Jun 22, 2025Updated last year
invoke-ai / InvokeAI
View on GitHub
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…
☆27,670Updated this week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,398Updated this week
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,969Mar 25, 2026Updated 4 months ago
facebookresearch / seamless_communication
View on GitHub
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,826Updated this week
haoheliu / AudioLDM
View on GitHub
AudioLDM: Generate speech, sound effects, music and beyond, with text.
☆2,905Jun 25, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,595Nov 19, 2025Updated 8 months ago
Edresson / YourTTS
View on GitHub
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
☆1,056Nov 4, 2024Updated last year
facebookresearch / encodec
View on GitHub
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
☆4,004Jan 4, 2024Updated 2 years ago
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,888Dec 6, 2023Updated 2 years ago
nomic-ai / gpt4all
View on GitHub
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
☆77,399May 27, 2025Updated last year
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,903Updated this week
speechbrain / speechbrain
View on GitHub
A PyTorch-based Speech Toolkit
☆11,717Jun 15, 2026Updated last month