SA-toolkit: Speaker speech anonymization toolkit in python
☆30Sep 18, 2025Updated 5 months ago
Alternatives and similar repositories for SA-toolkit
Users that are interested in SA-toolkit are comparing it to the libraries listed below
Sorting:
- Language independent SSL-based Speaker Anonymization system☆19May 28, 2024Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆61Updated this week
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆92Jul 4, 2025Updated 8 months ago
- Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software☆68Oct 17, 2024Updated last year
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆57May 14, 2024Updated last year
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆29Jul 9, 2024Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- Wav2vec 2.0 Self-Supervised Pretraining☆58Feb 6, 2025Updated last year
- ☆24Dec 20, 2022Updated 3 years ago
- MichiAI: A Low Latency, Full Duplex Speech LLM with zero coherence loss☆82Feb 6, 2026Updated 3 weeks ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆131Dec 8, 2023Updated 2 years ago
- ☆31Dec 2, 2020Updated 5 years ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆43Sep 24, 2025Updated 5 months ago
- ☆16Jun 12, 2025Updated 8 months ago
- Term Project at GTCMT exploring phase based features for Singing Voice Detection with Neural Networks☆11Apr 20, 2018Updated 7 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆11Aug 20, 2025Updated 6 months ago
- Repo for papers to read on adversarial attack and defense techniques in the audio domain.☆41Dec 6, 2020Updated 5 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- Temporal summarization framework☆10Dec 4, 2023Updated 2 years ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆46Nov 19, 2024Updated last year
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- ☆163Sep 19, 2022Updated 3 years ago
- ☆16Sep 29, 2025Updated 5 months ago
- ☆10Feb 22, 2016Updated 10 years ago
- Spectral and other frequency-based calculation objects developed by Tristan Murail☆10May 6, 2022Updated 3 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- ☆13Oct 3, 2025Updated 5 months ago
- [ICASSP 2025] Official implementation of "ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning".☆15Feb 2, 2025Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Workshop material: Composing 3D music using SuperCollider and ambisonics☆11Oct 1, 2022Updated 3 years ago
- Voevodsky's notes on type systems. This version contains more material than the one on his website.☆11Jun 15, 2014Updated 11 years ago
- A real-time voice conversion model based on VITS.☆14Aug 1, 2024Updated last year
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆11Dec 19, 2025Updated 2 months ago