coqui-ai / TrainerLinks
πΈ - A general purpose model trainer, as flexible as it gets
β222Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ273Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.β225Updated 2 years ago
- β262Updated last year
- β377Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ202Updated 3 years ago
- β274Updated last year
- Official Implementation of StyleTTSβ447Updated 8 months ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ257Updated 2 years ago
- NeMo text processing for ASR and TTSβ373Updated last week
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β178Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ148Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generationβ132Updated 2 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ341Updated 3 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ163Updated last year
- Your one-stop solution for voice dataset creationβ124Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β325Updated 10 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTSβ125Updated 3 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β325Updated 2 years ago
- Desktop application for neural speech synthesis written in C++β214Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ260Updated 8 months ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β290Updated 2 years ago
- Unofficial implementation of NVIDIA P-Flow TTS paperβ229Updated 8 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ107Updated last week
- β127Updated 3 weeks ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ127Updated 2 years ago
- Monotonic Alignment Searchβ97Updated 3 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β364Updated last year
- Official implementation of the TTS model Lina-Speechβ168Updated 8 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β103Updated 11 months ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ229Updated 3 years ago