coqui-ai/Trainer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/coqui-ai/Trainer)

coqui-ai / Trainer

🐸 - A general purpose model trainer, as flexible as it gets

☆233

Alternatives and similar repositories for Trainer

Users that are interested in Trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

coqui-ai / coqpit
View on GitHub
Simple but maybe too simple config management through python data classes. We use it for machine learning.
☆107Apr 12, 2023Updated 3 years ago
coqui-ai / stt-model-manager
View on GitHub
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
☆26Mar 24, 2023Updated 3 years ago
coqui-ai / TTS-papers
View on GitHub
🐸 collection of TTS papers
☆731Jul 4, 2024Updated 2 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
coqui-ai / xtts-streaming-server
View on GitHub
☆368Jun 26, 2024Updated 2 years ago
rhasspy / gruut
View on GitHub
A tokenizer, text cleaner, and phonemizer for many human languages.
☆330Nov 15, 2024Updated last year
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
coqui-ai / STT-models
View on GitHub
Open models for Coqui STT
☆153May 9, 2023Updated 3 years ago
tts-tutorial / book
View on GitHub
☆63Sep 18, 2022Updated 3 years ago
Edresson / YourTTS
View on GitHub
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
☆1,056Nov 4, 2024Updated last year
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,398Jun 6, 2024Updated 2 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
HumeAI / competitions
View on GitHub
Hume AI ML Competitions
☆31Apr 7, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tts-tutorial / interspeech2022
View on GitHub
☆162Sep 19, 2022Updated 3 years ago
Joshua-1995 / LearnableUpsamplingLayer-Pytorch
View on GitHub
Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)
☆57Mar 12, 2024Updated 2 years ago
bshall / acoustic-model
View on GitHub
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
☆105Mar 10, 2026Updated 4 months ago
speechio / asr-noises
View on GitHub
A handy dataset of noises for ASR
☆22May 29, 2019Updated 7 years ago
PriesiaMioShirakana / Pits-Japanese-Onnx
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆17Apr 13, 2023Updated 3 years ago
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Jul 14, 2026Updated 2 weeks ago
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
iisys-hof / HUI-Audio-Corpus-German
View on GitHub
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆35Mar 31, 2023Updated 3 years ago
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
rishikksh20 / Avocodo-pytorch
View on GitHub
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆122Jul 14, 2022Updated 4 years ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,827Aug 16, 2024Updated last year
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆270Jan 13, 2025Updated last year
wenet-e2e / wetts
View on GitHub
Production First and Production Ready End-to-End Text-to-Speech Toolkit
☆416Nov 20, 2025Updated 8 months ago
NVIDIA / radtts
View on GitHub
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …
☆291Apr 6, 2023Updated 3 years ago
quickvc / QuickVC-VoiceConversion
View on GitHub
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
☆261Jul 13, 2023Updated 3 years ago
maxrmorrison / promonet
View on GitHub
Prosody and Pronunciation Modification Network
☆64May 5, 2025Updated last year
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆62Oct 23, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
coqui-ai / STT
View on GitHub
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
☆2,596Mar 11, 2024Updated 2 years ago
AdolfVonKleist / RnnLMG2P
View on GitHub
Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs
☆30Dec 15, 2014Updated 11 years ago
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
roholazandie / ryan-tts
View on GitHub
☆18Jan 17, 2022Updated 4 years ago
liusongxiang / Large-Audio-Models
View on GitHub
Keep track of big models in audio domain, including speech, singing, music etc.
☆515Jul 3, 2026Updated 3 weeks ago