πΈ - A general purpose model trainer, as flexible as it gets
β233Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Simple but maybe too simple config management through python data classes. We use it for machine learning.β108Apr 12, 2023Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ26Mar 24, 2023Updated 2 years ago
- πΈ collection of TTS papersβ723Jul 4, 2024Updated last year
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- End-To-End SpeechSynthesis system with knowledge distillationβ18Jul 16, 2022Updated 3 years ago
- iSeparate library for the SDX2023 challengeβ14Dec 15, 2023Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β332Nov 15, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 3 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"β11Mar 24, 2023Updated 2 years ago
- β363Jun 26, 2024Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversionβ104Jul 12, 2023Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,386Jun 6, 2024Updated last year
- Open models for Coqui STTβ153May 9, 2023Updated 2 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,052Nov 4, 2024Updated last year
- Tau LLM made with Unity 6 ML Agentsβ17Apr 24, 2025Updated 10 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)β71Nov 10, 2023Updated 2 years ago
- β49Apr 1, 2025Updated 11 months ago
- β18Jan 17, 2022Updated 4 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictorβ17Apr 13, 2023Updated 2 years ago
- β64Sep 18, 2022Updated 3 years ago
- Single Channel Speech Enhancement Methods and Toolboxβ39Feb 26, 2026Updated last week
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ268Jan 13, 2025Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",β¦β80May 29, 2023Updated 2 years ago
- β163Sep 19, 2022Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)β57Mar 12, 2024Updated last year
- Prosody and Pronunciation Modification Networkβ63May 5, 2025Updated 10 months ago
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"β366Aug 3, 2023Updated 2 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repoβ¦β36Mar 31, 2023Updated 2 years ago
- A differentiable version of SPTKβ193Feb 26, 2026Updated last week
- β67Aug 16, 2023Updated 2 years ago
- ICASSP 2023 Acceptedβ190May 6, 2024Updated last year
- List of repositories relevant to VITS.β36Feb 26, 2023Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,320Jul 27, 2024Updated last year
- UT-Sarulab MOS prediction system using SSL modelsβ296Apr 11, 2024Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistencyβ59Oct 23, 2024Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.β87Nov 12, 2024Updated last year
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β291Apr 6, 2023Updated 2 years ago
- A handy dataset of noises for ASRβ22May 29, 2019Updated 6 years ago
- Official Implementation of StyleTTSβ462Jan 13, 2025Updated last year