jordandare/echo-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jordandare/echo-tts)

jordandare / echo-tts

Echo-TTS inference codebase

☆204

Alternatives and similar repositories for echo-tts

Users that are interested in echo-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KevinAHM / echo-tts-api
View on GitHub
Echo-TTS OpenAI Compatible Speech Endpoint w/ Streaming
☆29Apr 5, 2026Updated 3 months ago
Etherll / Timbre
View on GitHub
Extract a target speaker’s clean, non-overlapped speech from multi-speaker audio and export word-safe LJSpeech-style TTS datasets.
☆21Jun 14, 2026Updated last month
ysharma3501 / LayaCodec
View on GitHub
High fidelity neural audio codec for TTS models
☆36Dec 22, 2025Updated 7 months ago
Vyvo-Labs / CodecHub
View on GitHub
CodecHub: A Unified Library for Codec Models
☆25Dec 24, 2025Updated 7 months ago
yl4579 / DMOSpeech2
View on GitHub
☆302Jul 22, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Stylish-TTS / stylish-tts
View on GitHub
High quality text-to-speech based on StyleTTS 2.
☆78Apr 6, 2026Updated 3 months ago
ysharma3501 / MiraTTS
View on GitHub
A high quality and fast TTS repository
☆517Dec 22, 2025Updated 7 months ago
voicepowered-ai / VibeVoice-finetuning
View on GitHub
Unofficial WIP LoRa Finetuning repository for VibeVoice
☆369Sep 24, 2025Updated 10 months ago
ysharma3501 / FastNeuTTS
View on GitHub
A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!
☆118Nov 24, 2025Updated 8 months ago
smallbraineng / smalltts
View on GitHub
superfast text to speech in any voice
☆62Feb 16, 2026Updated 5 months ago
ekwek1 / soprano-factory
View on GitHub
Soprano-Factory: Train your own 2000x realtime text-to-speech model
☆252Jan 13, 2026Updated 6 months ago
Vyvo-Labs / VyvoTTS
View on GitHub
VyvoTTS: LLM-Based Text-to-Speech Training Framework
☆257Apr 8, 2026Updated 3 months ago
KoljaB / WhoSpeaksLive
View on GitHub
Private, real-time speaker diarization on hardware you control. See who is speaking as it happens, no third-party cloud required.
☆18Updated this week
alisson-anjos / chatterbox-finetune
View on GitHub
SoTA open-source TTS
☆23Jun 17, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ysharma3501 / FlashSR
View on GitHub
Fast audio super resolution from 16khz to 48khz.
☆215Jan 3, 2026Updated 6 months ago
ekwek1 / soprano
View on GitHub
Soprano: Instant, Ultra-Realistic Text-to-Speech
☆1,374Jan 15, 2026Updated 6 months ago
stlohrey / chatterbox-finetuning
View on GitHub
SoTA open-source TTS
☆136Jun 7, 2025Updated last year
frothywater / kanade-tokenizer
View on GitHub
Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…
☆108Jul 18, 2026Updated last week
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,019Dec 2, 2025Updated 7 months ago
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆101Jan 19, 2026Updated 6 months ago
Dawizzer / ComfyUI-Qwen3TTS-Emotional
View on GitHub
Voice cloning with 80+ emotions and multi-emotion mixing for ComfyUI
☆19Jan 25, 2026Updated 6 months ago
ysharma3501 / LinaCodec
View on GitHub
A highly compressive and high-quality neural audio codec for speech models.
☆269Jan 23, 2026Updated 6 months ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
davidbrowne17 / chatterbox-streaming
View on GitHub
Streaming and Fine-tuning for Chatterbox TTS
☆292Jun 15, 2025Updated last year
fakerybakery / openvoicelab
View on GitHub
A beginner-friendly inference to finetune & run inference on open TTS models 🗣️
☆30Feb 4, 2026Updated 5 months ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
Aratako / T5Gemma-TTS
View on GitHub
Multilingual TTS model with voice cloning and duration control, based on T5Gemma encoder-decoder LLM
☆311Apr 3, 2026Updated 3 months ago
vibevoice-community / VibeVoice
View on GitHub
VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)
☆1,146Jun 12, 2026Updated last month
HeCheng0625 / Diffusion-Speech-Tokenizer
View on GitHub
This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…
☆198Jan 25, 2026Updated 6 months ago
IDEA-Emdoor-Lab / UniTTS
View on GitHub
A TTS Trained on Universal Audio.
☆41Jun 6, 2025Updated last year
taylorchu / 2cent-tts
View on GitHub
☆58Feb 8, 2026Updated 5 months ago
AmphionTeam / TaDiCodec
View on GitHub
This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…
☆77Jan 25, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Saganaki22 / ComfyUI-Step_Audio_EditX_TTS
View on GitHub
ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice cloning and audio editing with emotion, style, speed control, and m…
☆62Dec 4, 2025Updated 7 months ago
nineninesix-ai / kani-tts
View on GitHub
☆461Nov 2, 2025Updated 8 months ago
smulelabs / smule-renaissance
View on GitHub
Official Repository of Smule Renaissance, Smule's Vocal Restoration Models
☆43Oct 27, 2025Updated 8 months ago
WWWWxp / M3-TTS
View on GitHub
Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis"
☆122Dec 18, 2025Updated 7 months ago
ysharma3501 / NovaSR
View on GitHub
A lightning fast audio upsampler.
☆775Feb 26, 2026Updated 4 months ago
idiap / knn-tts
View on GitHub
Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model
☆36Apr 29, 2025Updated last year
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago