coqui-ai / TrainerLinks
πΈ - A general purpose model trainer, as flexible as it gets
β220Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ271Updated last year
- β370Updated 10 months ago
- Official Implementation of StyleTTSβ439Updated 6 months ago
- β260Updated last year
- NeMo text processing for ASR and TTSβ347Updated this week
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ254Updated last year
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ338Updated 3 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β173Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β320Updated 8 months ago
- [WIP] VoiceSmith makes training text to speech models easy.β225Updated 2 years ago
- A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)β474Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.β361Updated last year
- Grapheme to phoneme conversion with deep learning.β389Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ202Updated 2 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β326Updated 2 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β287Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ260Updated 6 months ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ159Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ104Updated 5 months ago
- Performant and accurate speech recognition built on Pytorchβ253Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learningβ220Updated last week
- Your one-stop solution for voice dataset creationβ121Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ147Updated last year
- Desktop application for neural speech synthesis written in C++β215Updated 2 years ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.β582Updated 2 years ago
- β359Updated last year
- Unofficial implementation of NVIDIA P-Flow TTS paperβ225Updated 6 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ149Updated last year
- β195Updated 3 years ago
- An Open-source Streaming High-fidelity Neural Audio Codecβ479Updated 4 months ago