MontrealCorpusTools / mfa-models
Collection of pretrained models for the Montreal Forced Aligner
☆130Updated 7 months ago
Alternatives and similar repositories for mfa-models:
Users that are interested in mfa-models are comparing it to the libraries listed below
- ☆111Updated 2 years ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆189Updated 5 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆122Updated 2 years ago
- It's a repository for implementations of neural speech editing algorithms.☆193Updated last year
- Charsiu: A neural phonetic aligner.☆292Updated 2 years ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆187Updated 10 months ago
- A sequence-to-sequence voice conversion toolkit.☆93Updated 7 months ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- UT-Sarulab MOS prediction system using SSL models☆208Updated 10 months ago
- Multilingual G2P in 100 languages☆299Updated last year
- Easy-to-Use Speech MOS predictors☆264Updated last year
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆145Updated 3 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆162Updated 10 months ago
- ☆115Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆121Updated 2 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆159Updated 3 years ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆143Updated 11 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆146Updated 2 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆242Updated 3 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆130Updated last year
- ☆74Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 8 months ago
- Audio Codec Speech processing Universal PERformance Benchmark☆238Updated 3 months ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 2 years ago
- multilingual speech aligner☆72Updated last year
- Reference-aware automatic speech evaluation toolkit☆142Updated 2 months ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆194Updated 2 weeks ago
- Chinese Text Normalization and Dataset☆82Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Unofficial implementation of miipher☆119Updated 10 months ago