MontrealCorpusTools / mfa-models
Collection of pretrained models for the Montreal Forced Aligner
☆133Updated 8 months ago
Alternatives and similar repositories for mfa-models:
Users that are interested in mfa-models are comparing it to the libraries listed below
- ☆112Updated 2 years ago
- Charsiu: A neural phonetic aligner.☆294Updated 2 years ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆191Updated 6 months ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆192Updated 2 years ago
- Multilingual G2P in 100 languages☆304Updated last year
- ☆115Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆216Updated 11 months ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆162Updated 11 months ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆156Updated last year
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆125Updated 2 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆145Updated 3 years ago
- This is the GitHub page for publicly available emotional speech data.☆342Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆115Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆241Updated 2 months ago
- ☆75Updated 2 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆102Updated 4 months ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 3 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- Unofficial implementation of miipher☆119Updated 10 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆133Updated 2 months ago
- Easy-to-Use Speech MOS predictors☆270Updated last year
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆242Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆140Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆147Updated last year
- ☆63Updated 6 months ago
- It's a repository for implementations of neural speech editing algorithms.☆194Updated last year
- Chinese Text Normalization and Dataset☆82Updated 2 years ago