coqui-ai / Trainer
πΈ - A general purpose model trainer, as flexible as it gets
β205Updated 11 months ago
Alternatives and similar repositories for Trainer:
Users that are interested in Trainer are comparing it to the libraries listed below
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorchβ266Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ197Updated this week
- β350Updated 10 months ago
- NeMo text processing for ASR and TTSβ304Updated 2 weeks ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ246Updated last year
- β254Updated 11 months ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official codeβ199Updated 2 years ago
- β344Updated 5 months ago
- Official Implementation of StyleTTSβ417Updated last month
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated 9 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β301Updated 3 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β340Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ233Updated last month
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β324Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,β¦β295Updated 3 years ago
- Open models for Coqui STTβ127Updated last year
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ148Updated last year
- Unofficial implementation of NVIDIA P-Flow TTS paperβ220Updated last month
- Desktop application for neural speech synthesis written in C++β213Updated last year
- TorToiSe fine-tuning with DLASβ218Updated 6 months ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ229Updated 2 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β285Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.β224Updated 2 years ago
- The reproduced code for Google's SoundStormβ264Updated last year
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β235Updated 8 months ago
- Your one-stop solution for voice dataset creationβ117Updated last year
- Grapheme to phoneme conversion with deep learning.β375Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)β157Updated last week
- Repository for the paper: VoiceMe: Personalized voice generation in TTSβ126Updated 2 years ago
- β69Updated 2 months ago