coqui-ai / Trainer
πΈ - A general purpose model trainer, as flexible as it gets
β216Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ269Updated last year
- Official Implementation of StyleTTSβ431Updated 4 months ago
- Unofficial implementation of NVIDIA P-Flow TTS paperβ222Updated 4 months ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ154Updated last year
- TorToiSe fine-tuning with DLASβ220Updated 9 months ago
- [WIP] VoiceSmith makes training text to speech models easy.β224Updated 2 years ago
- Performant and accurate speech recognition built on Pytorchβ253Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transformβ246Updated 2 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ335Updated 3 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ251Updated last year
- β353Updated last year
- β359Updated 8 months ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β286Updated 2 years ago
- Your one-stop solution for voice dataset creationβ119Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ201Updated 2 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023β216Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ248Updated 4 months ago
- The reproduced code for Google's SoundStormβ267Updated last year
- Open models for Coqui STTβ138Updated 2 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β325Updated 2 years ago
- A TTS model that makes a speaker speak new languagesβ76Updated 10 months ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ230Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ132Updated last year
- The Open Source Code of UniAudioβ562Updated 9 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ114Updated 2 years ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"β343Updated 8 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ100Updated 3 months ago
- β256Updated last year
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Searchβ87Updated 3 years ago