coqui-ai / TrainerLinks
πΈ - A general purpose model trainer, as flexible as it gets
β218Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ338Updated 3 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ254Updated last year
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ271Updated last year
- Official Implementation of StyleTTSβ435Updated 5 months ago
- NeMo text processing for ASR and TTSβ342Updated last week
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β326Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β317Updated 7 months ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β361Updated 2 years ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorchβ656Updated 8 months ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ202Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ258Updated 5 months ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"β347Updated 9 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transformβ250Updated 2 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTSβ125Updated 3 years ago
- Grapheme to phoneme conversion with deep learning.β388Updated last year
- β365Updated 9 months ago
- Segment an audio file and obtain utterance alignments. (Python package)β337Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ229Updated 3 years ago
- Unofficial implementation of NVIDIA P-Flow TTS paperβ225Updated 6 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β170Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.β225Updated 2 years ago
- A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)β475Updated last year
- πΈSTT integration examplesβ129Updated 2 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Sβ¦β407Updated last year
- Official Implementation of StyleTTS-VCβ184Updated 5 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated last year
- β359Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ218Updated last week
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,β¦β302Updated 3 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β286Updated 2 years ago