πΈ - A general purpose model trainer, as flexible as it gets
β234Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple but maybe too simple config management through python data classes. We use it for machine learning.β108Apr 12, 2023Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ27Mar 24, 2023Updated 3 years ago
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- πΈ collection of TTS papersβ726Jul 4, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- End-To-End SpeechSynthesis system with knowledge distillationβ18Jul 16, 2022Updated 3 years ago
- β364Jun 26, 2024Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β331Nov 15, 2024Updated last year
- Open models for Coqui STTβ154May 9, 2023Updated 2 years ago
- β64Sep 18, 2022Updated 3 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,055Nov 4, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)β71Nov 10, 2023Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,391Jun 6, 2024Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversionβ104Mar 10, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β163Sep 19, 2022Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)β57Mar 12, 2024Updated 2 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"β11Mar 24, 2023Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repoβ¦β36Mar 31, 2023Updated 3 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictorβ17Apr 13, 2023Updated 3 years ago
- Single Channel Speech Enhancement Methods and Toolboxβ50Apr 8, 2026Updated last week
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,043Aug 16, 2024Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ268Jan 13, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",β¦β80May 29, 2023Updated 2 years ago
- iSeparate library for the SDX2023 challengeβ15Dec 15, 2023Updated 2 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkitβ416Nov 20, 2025Updated 4 months ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β291Apr 6, 2023Updated 3 years ago
- A differentiable version of SPTKβ197Mar 26, 2026Updated 3 weeks ago
- ICASSP 2023 Acceptedβ190May 6, 2024Updated last year
- Prosody and Pronunciation Modification Networkβ63May 5, 2025Updated 11 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistencyβ62Oct 23, 2024Updated last year
- β50Mar 5, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)β119Feb 7, 2024Updated 2 years ago
- β18Jan 17, 2022Updated 4 years ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,581Mar 11, 2024Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,340Jul 27, 2024Updated last year
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Aug 17, 2023Updated 2 years ago
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.β194Jun 8, 2023Updated 2 years ago
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"β366Aug 3, 2023Updated 2 years ago