coqui-ai / TrainerLinks
πΈ  - A general purpose model trainer, as flexible as it gets
β227Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ274Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.β226Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β326Updated 11 months ago
- β378Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β177Updated last year
- β262Updated last year
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ259Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ132Updated 2 years ago
- Official Implementation of StyleTTSβ453Updated 9 months ago
- NeMo text processing for ASR and TTSβ380Updated this week
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ341Updated 3 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ163Updated last year
- β275Updated last year
- Your one-stop solution for voice dataset creationβ127Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ202Updated 3 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ151Updated last year
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β326Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTSβ126Updated 3 years ago
- β145Updated 2 weeks ago
- On-device voice activity detection (VAD) powered by deep learningβ232Updated last month
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β290Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β102Updated last year
- Unofficial implementation of NVIDIA P-Flow TTS paperβ230Updated 10 months ago
- Performant and accurate speech recognition built on Pytorchβ254Updated 3 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ262Updated 9 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.β589Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β360Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ127Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ229Updated 3 years ago
- Desktop application for neural speech synthesis written in C++β213Updated 2 years ago