coqui-ai / TrainerLinks
πΈ - A general purpose model trainer, as flexible as it gets
β233Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ277Updated 2 years ago
- β258Updated last year
- β386Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β179Updated last year
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ260Updated 2 months ago
- NeMo text processing for ASR and TTSβ418Updated this week
- [WIP] VoiceSmith makes training text to speech models easy.β228Updated 3 years ago
- Official Implementation of StyleTTSβ460Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β332Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ150Updated 2 years ago
- Your one-stop solution for voice dataset creationβ128Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ203Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTSβ126Updated 3 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ341Updated 3 years ago
- Putting flows on top of neural transducers for better TTSβ64Updated last week
- Faster Tortoise inference then Tortoise Fast Forkβ127Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generationβ131Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ266Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ154Updated last year
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ175Updated 2 years ago
- Desktop application for neural speech synthesis written in C++β212Updated last week
- β172Updated this week
- On-device voice activity detection (VAD) powered by deep learningβ243Updated last week
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β328Updated 3 years ago
- β204Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β104Updated last year
- β275Updated last year
- The reproduced code for Google's SoundStormβ270Updated 2 years ago
- β257Updated 2 years ago
- β357Updated last year