MOS score prediction by fine-tuned wav2vec2.0 model
☆176Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for wvmos
Users that are interested in wvmos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 4 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆159Jul 16, 2022Updated 3 years ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆147Aug 22, 2022Updated 3 years ago
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆152Sep 14, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A differentiable version of SPTK☆196Feb 26, 2026Updated last month
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆922Dec 1, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆55Jan 13, 2023Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆297Apr 11, 2024Updated last year
- Official Implementation of StyleTTS-VC☆198Jan 14, 2025Updated last year
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Evaluation and Benchmarking of Speech Super-resolution Methods☆153Jun 17, 2022Updated 3 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆270Jul 29, 2023Updated 2 years ago
- Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)☆154Feb 1, 2023Updated 3 years ago
- ☆87May 21, 2023Updated 2 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- ☆32Jan 6, 2022Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆274Jul 15, 2025Updated 8 months ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- Pytorch implementation of BigVSAN☆203Dec 9, 2025Updated 3 months ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆372Sep 3, 2024Updated last year
- The open source code for SimpleSpeech series☆145Oct 8, 2024Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Nov 1, 2024Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated 2 years ago
- Easy-to-Use Speech MOS predictors☆348Oct 24, 2023Updated 2 years ago
- ☆61Nov 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆223Oct 20, 2023Updated 2 years ago
- ☆26Jun 5, 2024Updated last year
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Apr 19, 2023Updated 2 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆64Nov 18, 2024Updated last year
- ☆259May 15, 2023Updated 2 years ago
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago