nii-yamagishilab / mos-finetune-sslLinks
☆106Updated 2 years ago
Alternatives and similar repositories for mos-finetune-ssl
Users that are interested in mos-finetune-ssl are comparing it to the libraries listed below
Sorting:
- Speech Human Evaluation Estimation Toolkit (SHEET)☆121Updated 3 weeks ago
- Reference-aware automatic speech evaluation toolkit☆164Updated 10 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- ☆58Updated last year
- ☆55Updated 10 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆74Updated 2 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- ☆37Updated 4 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆130Updated last year
- Alignment files of LibriTTS.☆64Updated 5 years ago
- ☆32Updated 11 months ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆76Updated 3 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆31Updated 2 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- ☆63Updated 2 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆193Updated last year
- MOS score prediction by fine-tuned wav2vec2.0 model☆169Updated 3 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆51Updated last year
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆57Updated 2 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆117Updated last year
- UT-Sarulab MOS prediction system using SSL models☆274Updated last year
- A simple package for Guided source separation (GSS)☆128Updated last year
- ☆35Updated 2 years ago
- ☆57Updated 6 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆110Updated 4 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆60Updated 8 months ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆87Updated 2 years ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Updated 2 years ago
- The official source code of UniAudio☆94Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆65Updated 4 months ago