πΈ - A general purpose model trainer, as flexible as it gets
β234Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple but maybe too simple config management through python data classes. We use it for machine learning.β108Apr 12, 2023Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ26Mar 24, 2023Updated 3 years ago
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- πΈ collection of TTS papersβ723Jul 4, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- End-To-End SpeechSynthesis system with knowledge distillationβ18Jul 16, 2022Updated 3 years ago
- β363Jun 26, 2024Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β331Nov 15, 2024Updated last year
- Open models for Coqui STTβ153May 9, 2023Updated 2 years ago
- β64Sep 18, 2022Updated 3 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,053Nov 4, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)β71Nov 10, 2023Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,390Jun 6, 2024Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversionβ104Mar 10, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- β163Sep 19, 2022Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)β57Mar 12, 2024Updated 2 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"β11Mar 24, 2023Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repoβ¦β36Mar 31, 2023Updated 2 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictorβ17Apr 13, 2023Updated 2 years ago
- Single Channel Speech Enhancement Methods and Toolboxβ48Feb 26, 2026Updated last month
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,896Aug 16, 2024Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ268Jan 13, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",β¦β80May 29, 2023Updated 2 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkitβ417Nov 20, 2025Updated 4 months ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β291Apr 6, 2023Updated 2 years ago
- ICASSP 2023 Acceptedβ190May 6, 2024Updated last year
- A differentiable version of SPTKβ196Feb 26, 2026Updated last month
- Prosody and Pronunciation Modification Networkβ63May 5, 2025Updated 10 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistencyβ60Oct 23, 2024Updated last year
- β51Mar 5, 2026Updated 3 weeks ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)β119Feb 7, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β18Jan 17, 2022Updated 4 years ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,577Mar 11, 2024Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,332Jul 27, 2024Updated last year
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.β194Jun 8, 2023Updated 2 years ago
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"β366Aug 3, 2023Updated 2 years ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMsβ31Dec 15, 2014Updated 11 years ago
- Hume AI ML Competitionsβ28Oct 28, 2022Updated 3 years ago