coqui-ai / TrainerLinks
šø - A general purpose model trainer, as flexible as it gets
ā222Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchā271Updated last year
- Official Implementation of StyleTTSā439Updated 6 months ago
- NeMo text processing for ASR and TTSā351Updated this week
- ā260Updated last year
- š¤ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationā255Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.ā174Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationā148Updated last year
- ā374Updated 11 months ago
- [WIP] VoiceSmith makes training text to speech models easy.ā225Updated 2 years ago
- Open models for Coqui STTā141Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.ā137Updated last year
- ā116Updated last week
- Your one-stop solution for voice dataset creationā122Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.ā321Updated 8 months ago
- On-device voice activity detection (VAD) powered by deep learningā223Updated this week
- A live speech recognition using Facebooks wav2vec 2.0 model.ā362Updated last year
- A curated list of awesome voice activity detectionā59Updated 8 months ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeā201Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeā149Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generationā132Updated 2 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechā339Updated 3 years ago
- Desktop application for neural speech synthesis written in C++ā215Updated 2 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsā162Updated last year
- ā273Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā101Updated 10 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTSā125Updated 3 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, ā¦ā288Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's toolsā160Updated last year
- Putting flows on top of neural transducers for better TTSā62Updated last month
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.ā583Updated 2 years ago