coqui-ai / TrainerLinks
πΈ - A general purpose model trainer, as flexible as it gets
β218Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ271Updated last year
- β257Updated last year
- Unofficial implementation of NVIDIA P-Flow TTS paperβ223Updated 5 months ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ335Updated 3 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β286Updated 2 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β169Updated last year
- β363Updated 9 months ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ230Updated 2 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β326Updated 2 years ago
- NeMo text processing for ASR and TTSβ338Updated 3 weeks ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ253Updated last year
- Official Implementation of StyleTTSβ432Updated 4 months ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ202Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.β225Updated 2 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speechβ463Updated last year
- Your one-stop solution for voice dataset creationβ119Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ251Updated 4 months ago
- Grapheme to phoneme conversion with deep learning.β384Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β308Updated 2 years ago
- Performant and accurate speech recognition built on Pytorchβ253Updated 3 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,β¦β301Updated 3 years ago
- πΈSTT integration examplesβ128Updated 2 years ago
- Official Implementation of StyleTTS-VCβ182Updated 4 months ago
- A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)β475Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β315Updated 6 months ago
- Faster Tortoise inference then Tortoise Fast Forkβ126Updated last year
- The reproduced code for Google's SoundStormβ267Updated last year
- β294Updated 11 months ago
- Putting flows on top of neural transducers for better TTSβ62Updated last week
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ155Updated last year