coqui-ai / TrainerLinks
πΈ - A general purpose model trainer, as flexible as it gets
β233Updated last year
Alternatives and similar repositories for Trainer
Users that are interested in Trainer are comparing it to the libraries listed below
Sorting:
- β389Updated last year
- β258Updated last year
- Performant and accurate speech recognition built on Pytorchβ254Updated 3 years ago
- Official Implementation of StyleTTSβ460Updated last year
- Your one-stop solution for voice dataset creationβ128Updated 2 years ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ277Updated 2 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ260Updated 2 months ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ341Updated 3 years ago
- [WIP] VoiceSmith makes training text to speech models easy.β228Updated 3 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β376Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β331Updated last year
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ175Updated 2 years ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.β588Updated 2 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β179Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ150Updated 2 years ago
- NeMo text processing for ASR and TTSβ424Updated last week
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ203Updated 3 years ago
- β357Updated last year
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β292Updated 2 years ago
- Desktop application for neural speech synthesis written in C++β212Updated last week
- Putting flows on top of neural transducers for better TTSβ65Updated 3 weeks ago
- Open models for Coqui STTβ153Updated 2 years ago
- β275Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generationβ131Updated 2 years ago
- A toolkit for processing speech data and creating speech datasetsβ200Updated last week
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ267Updated last year
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpusβ219Updated 3 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β328Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β360Updated 2 years ago