Language independent SSL-based Speaker Anonymization system
☆19May 28, 2024Updated last year
Alternatives and similar repositories for SSL-SAS
Users that are interested in SSL-SAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆98Jul 4, 2025Updated 9 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆58May 14, 2024Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- ☆16Feb 19, 2026Updated last month
- SA-toolkit: Speaker speech anonymization toolkit in python☆32Sep 18, 2025Updated 7 months ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆11Dec 14, 2017Updated 8 years ago
- Dialog Acts SEGmentation: Tools for dialog act research☆14Mar 21, 2025Updated last year
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- A Chinese Expressive Long-dialogue Speech Dataset with Scripts☆21Nov 11, 2024Updated last year
- My personal website☆11Dec 22, 2024Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Mar 21, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆46Apr 16, 2023Updated 3 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- Download AudioSet for Vision-Audio-Text Pre-training☆13May 16, 2022Updated 3 years ago
- Simple LPC vocoder in Python☆13Jan 7, 2022Updated 4 years ago
- Privacy-preserving Voice Analysis via Disentangled Representations☆11Aug 30, 2021Updated 4 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- [ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.☆17May 24, 2024Updated last year
- KDSS is the framework for knowledge distillation from LLMs☆12Nov 5, 2025Updated 5 months ago
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆31Jul 11, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- ☆19Apr 28, 2023Updated 2 years ago
- ☆20Jun 5, 2022Updated 3 years ago
- ☆10Oct 25, 2019Updated 6 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- ☆11May 23, 2023Updated 2 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 5 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- We introduce a way to extend sparse dictionary learning to deep architectures.☆17Jan 13, 2022Updated 4 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 5 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆54Dec 11, 2022Updated 3 years ago