coqui-ai / TrainerLinks
πΈ - A general purpose model trainer, as flexible as it gets
β231Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ275Updated 2 years ago
- β385Updated last year
- β261Updated last year
- Official Implementation of StyleTTSβ456Updated 11 months ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ172Updated 2 years ago
- NeMo text processing for ASR and TTSβ411Updated this week
- [WIP] VoiceSmith makes training text to speech models easy.β228Updated 3 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ261Updated last month
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ151Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β178Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ203Updated 3 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β328Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β331Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.β375Updated last year
- Desktop application for neural speech synthesis written in C++β213Updated 2 years ago
- Your one-stop solution for voice dataset creationβ128Updated 2 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ341Updated 3 years ago
- β158Updated last month
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"β365Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ232Updated 3 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ264Updated 11 months ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β292Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ111Updated last month
- A curated list of awesome voice conversion, projects and communities.β258Updated last month
- Unofficial implementation of NVIDIA P-Flow TTS paperβ231Updated last year
- β319Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.β589Updated 2 years ago
- β275Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ241Updated last week