Voice-Privacy-Challenge / Voice-Privacy-Challenge-2022View external linksLinks
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
☆68Oct 17, 2024Updated last year
Alternatives and similar repositories for Voice-Privacy-Challenge-2022
Users that are interested in Voice-Privacy-Challenge-2022 are comparing it to the libraries listed below
Sorting:
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆56May 14, 2024Updated last year
- Privacy-preserving Voice Analysis via Disentangled Representations☆11Aug 30, 2021Updated 4 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆30Sep 18, 2025Updated 4 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆61Jan 30, 2025Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆64Jul 6, 2023Updated 2 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Jun 28, 2024Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- Implementation of the paper "Can Large Language Models Predict Audio Effects Parameters from Natural Language?"☆26May 27, 2025Updated 8 months ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆64Aug 24, 2025Updated 5 months ago
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆91Jul 4, 2025Updated 7 months ago
- ☆24Dec 20, 2022Updated 3 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Dec 18, 2021Updated 4 years ago
- ☆28Dec 14, 2021Updated 4 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- Collect Voice Conversion researches☆96Updated this week
- Python implementation of a few speech intelligibility prediction algorithms☆15May 29, 2024Updated last year
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Feb 6, 2025Updated last year
- UT-Sarulab MOS prediction system using SSL models☆294Apr 11, 2024Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆153Jun 17, 2022Updated 3 years ago
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆40Jun 17, 2025Updated 7 months ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆24Feb 17, 2023Updated 2 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 7 years ago
- ☆23Jan 6, 2023Updated 3 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- This is the GitHub page for publicly available emotional speech data.☆381Jan 6, 2022Updated 4 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Jun 16, 2022Updated 3 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago