AndreevP / wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
☆156Updated 2 years ago
Alternatives and similar repositories for wvmos:
Users that are interested in wvmos are comparing it to the libraries listed below
- UT-Sarulab MOS prediction system using SSL models☆217Updated 11 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆148Updated 2 years ago
- Reference-aware automatic speech evaluation toolkit☆144Updated 3 months ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆103Updated 3 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- ☆91Updated last year
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated 2 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆67Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- UTokyo-SaruLab MOS Prediction System☆160Updated last month
- ☆163Updated 2 years ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆156Updated 2 months ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆97Updated 2 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆128Updated 9 months ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆118Updated 2 years ago
- ☆54Updated last year
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆126Updated 9 months ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆97Updated last year
- ☆123Updated 2 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆147Updated 6 months ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆133Updated last year
- A simple package for Guided source separation (GSS)☆118Updated 10 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆85Updated 8 months ago
- ☆112Updated 2 years ago
- Unofficial implementation of miipher☆120Updated 11 months ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆192Updated 6 months ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆52Updated 3 months ago