πΈ - A general purpose model trainer, as flexible as it gets
β234Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple but maybe too simple config management through python data classes. We use it for machine learning.β108Apr 12, 2023Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ27Mar 24, 2023Updated 3 years ago
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- πΈ collection of TTS papersβ730Jul 4, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ16Mar 26, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- End-To-End SpeechSynthesis system with knowledge distillationβ18Jul 16, 2022Updated 3 years ago
- β367Jun 26, 2024Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β330Nov 15, 2024Updated last year
- Open models for Coqui STTβ155May 9, 2023Updated 3 years ago
- β64Sep 18, 2022Updated 3 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,056Nov 4, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)β71Nov 10, 2023Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,398Jun 6, 2024Updated 2 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversionβ104Mar 10, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β163Sep 19, 2022Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)β57Mar 12, 2024Updated 2 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"β11Mar 24, 2023Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repoβ¦β35Mar 31, 2023Updated 3 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictorβ17Apr 13, 2023Updated 3 years ago
- Single Channel Speech Enhancement Methods and Toolboxβ54Apr 8, 2026Updated 2 months ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,567Aug 16, 2024Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ269Jan 13, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",β¦β80May 29, 2023Updated 3 years ago
- iSeparate library for the SDX2023 challengeβ15Dec 15, 2023Updated 2 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkitβ416Nov 20, 2025Updated 6 months ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β293Apr 6, 2023Updated 3 years ago
- ICASSP 2023 Acceptedβ191May 6, 2024Updated 2 years ago
- A differentiable version of SPTKβ201Jun 2, 2026Updated 2 weeks ago
- Prosody and Pronunciation Modification Networkβ63May 5, 2025Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistencyβ62Oct 23, 2024Updated last year
- β51Mar 5, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)β119Feb 7, 2024Updated 2 years ago
- β18Jan 17, 2022Updated 4 years ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,587Mar 11, 2024Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,353Jul 27, 2024Updated last year
- Gui for users who use the coqui-TTS vits model.β15Sep 16, 2022Updated 3 years ago
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.β195Jun 8, 2023Updated 3 years ago
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"β367Aug 3, 2023Updated 2 years ago