nii-yamagishilab / SSL-SASLinks
Language independent SSL-based Speaker Anonymization system
☆19Updated last year
Alternatives and similar repositories for SSL-SAS
Users that are interested in SSL-SAS are comparing it to the libraries listed below
Sorting:
- SA-toolkit: Speaker speech anonymization toolkit in python☆28Updated 3 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆61Updated 10 months ago
- ☆32Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆44Updated 7 months ago
- Prediction of sound event bounding boxes (SEBBs)☆31Updated last year
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Updated last month
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆73Updated 6 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆53Updated last year
- Objective metrics used in several text-to-speech (TTS) papers.☆51Updated 6 months ago
- ☆35Updated 2 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆36Updated 2 months ago
- ☆59Updated 2 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- ☆45Updated 2 years ago
- ☆26Updated last year
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆23Updated last year
- ☆61Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆44Updated 4 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated 2 years ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated last month
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- A CSRankings-like index for speech researchers☆35Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Updated 2 years ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆41Updated 5 months ago
- ☆54Updated 2 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Updated 2 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year