Voice-Privacy-Challenge / Voice-Privacy-Challenge-2024
Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software
☆52Updated 2 months ago
Alternatives and similar repositories for Voice-Privacy-Challenge-2024:
Users that are interested in Voice-Privacy-Challenge-2024 are comparing it to the libraries listed below
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆51Updated 11 months ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆60Updated last year
- Reference-aware automatic speech evaluation toolkit☆153Updated 4 months ago
- ☆48Updated 7 months ago
- High-Fidelity Neural Phonetic Posteriorgrams☆109Updated 2 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆76Updated 4 months ago
- ☆55Updated 10 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆76Updated 3 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆26Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆65Updated 5 months ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆51Updated last week
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆56Updated 7 months ago
- ☆51Updated 5 months ago
- ☆43Updated 2 years ago
- ☆33Updated 4 years ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆31Updated last month
- ☆96Updated last year
- SA-toolkit: Speaker speech anonymization toolkit in python☆23Updated last month
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆129Updated 10 months ago
- used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...☆15Updated 5 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆20Updated 7 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆86Updated 5 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 8 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- UTokyo-SaruLab MOS Prediction System☆173Updated 3 weeks ago
- ☆80Updated 8 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆101Updated 6 months ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆22Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago