AndreevP / wvmosView external linksLinks
MOS score prediction by fine-tuned wav2vec2.0 model
☆174Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for wvmos
Users that are interested in wvmos are comparing it to the libraries listed below
Sorting:
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 3 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆158Jul 16, 2022Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆146Aug 22, 2022Updated 3 years ago
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆152Sep 14, 2023Updated 2 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Official Implementation of StyleTTS-VC☆196Jan 14, 2025Updated last year
- UT-Sarulab MOS prediction system using SSL models☆294Apr 11, 2024Updated last year
- ☆46Apr 16, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆267Jul 29, 2023Updated 2 years ago
- A differentiable version of SPTK☆192Feb 3, 2026Updated last week
- Evaluation and Benchmarking of Speech Super-resolution Methods☆153Jun 17, 2022Updated 3 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆911Dec 1, 2024Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- ☆32Jan 6, 2022Updated 4 years ago
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)☆153Feb 1, 2023Updated 3 years ago
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- ☆26Jun 5, 2024Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated last year
- It's a repository for implementations of neural speech editing algorithms.☆203Jan 9, 2024Updated 2 years ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆223Oct 20, 2023Updated 2 years ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Dec 3, 2025Updated 2 months ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆366Sep 3, 2024Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Oct 23, 2024Updated last year
- ☆26Sep 22, 2022Updated 3 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- Easy-to-Use Speech MOS predictors☆346Oct 24, 2023Updated 2 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Jun 16, 2022Updated 3 years ago
- ☆61Nov 4, 2023Updated 2 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Nov 1, 2024Updated last year
- ☆86May 21, 2023Updated 2 years ago
- ☆44Sep 19, 2024Updated last year
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆97Nov 14, 2024Updated last year