DigitalPhonetics/IMS-Toucan

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DigitalPhonetics/IMS-Toucan)

DigitalPhonetics / IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

☆2,206

Alternatives and similar repositories for IMS-Toucan

Users that are interested in IMS-Toucan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

p0p4k / pflowtts_pytorch
View on GitHub
Unofficial implementation of NVIDIA P-Flow TTS paper
☆228Dec 24, 2024Updated last year
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,582Dec 10, 2024Updated last year
keonlee9420 / Comprehensive-Transformer-TTS
View on GitHub
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…
☆328Sep 24, 2022Updated 3 years ago
Camb-ai / MARS5-TTS
View on GitHub
MARS5 speech model (TTS) from CAMB.AI
☆2,816Aug 1, 2024Updated last year
shivammehta25 / Matcha-TTS
View on GitHub
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
☆1,333Jul 13, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PolyAI-LDN / pheme
View on GitHub
☆258Mar 15, 2024Updated 2 years ago
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,312Aug 10, 2024Updated last year
Edresson / YourTTS
View on GitHub
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
☆1,057Nov 4, 2024Updated last year
X-LANCE / VoiceFlow-TTS
View on GitHub
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
☆376Sep 3, 2024Updated last year
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆268Jan 13, 2025Updated last year
adelacvg / NS2VC
View on GitHub
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
☆236Feb 29, 2024Updated 2 years ago
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
Rongjiehuang / GenerSpeech
View on GitHub
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
☆333Feb 9, 2024Updated 2 years ago
yl4579 / StyleTTS
View on GitHub
Official Implementation of StyleTTS
☆466Jan 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 2 years ago
metavoiceio / metavoice-src
View on GitHub
Foundational model for human-like, expressive TTS
☆4,203Jul 30, 2024Updated last year
spring-media / DeepForcedAligner
View on GitHub
☆81Aug 8, 2025Updated 11 months ago
rendchevi / nix-tts
View on GitHub
🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation
☆264Nov 15, 2025Updated 8 months ago
b04901014 / MQTTS
View on GitHub
☆260May 15, 2023Updated 3 years ago
KdaiP / StableTTS
View on GitHub
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
☆438Sep 13, 2024Updated last year
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
huggingface / dataspeech
View on GitHub
☆399Sep 3, 2024Updated last year
axelspringer / DeepPhonemizer
View on GitHub
Grapheme to phoneme conversion with deep learning.
☆432Dec 8, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
lucidrains / naturalspeech2-pytorch
View on GitHub
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
☆1,333Sep 24, 2023Updated 2 years ago
nii-yamagishilab / ZMM-TTS
View on GitHub
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
☆183Mar 6, 2024Updated 2 years ago
dunky11 / voicesmith
View on GitHub
[WIP] VoiceSmith makes training text to speech models easy.
☆231Oct 10, 2022Updated 3 years ago
KevinMIN95 / StyleSpeech
View on GitHub
Official implementation of Meta-StyleSpeech and StyleSpeech
☆253Feb 9, 2022Updated 4 years ago
lucidrains / spear-tts-pytorch
View on GitHub
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
☆277Oct 30, 2023Updated 2 years ago
VinAIResearch / XPhoneBERT
View on GitHub
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
☆354Jul 22, 2024Updated last year
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,557Sep 26, 2024Updated last year
p0p4k / vits2_pytorch
View on GitHub
unofficial vits2-TTS implementation in pytorch
☆548Mar 28, 2024Updated 2 years ago
lingjzhu / CharsiuG2P
View on GitHub
Multilingual G2P in 100 languages
☆389May 26, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
k2-fsa / libriheavy
View on GitHub
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
☆220Sep 10, 2024Updated last year
xinjli / transphone
View on GitHub
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆174Jun 9, 2023Updated 3 years ago
rishikksh20 / Avocodo-pytorch
View on GitHub
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆122Jul 14, 2022Updated 4 years ago
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,625Dec 14, 2025Updated 7 months ago
liutaocode / TTS-arxiv-daily
View on GitHub
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
☆661Updated this week
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
line / LibriTTS-P
View on GitHub
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
☆161Jun 13, 2024Updated 2 years ago