DigitalPhonetics/VoicePAT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DigitalPhonetics/VoicePAT)

DigitalPhonetics / VoicePAT

VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

☆59

Alternatives and similar repositories for VoicePAT

Users that are interested in VoicePAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Voice-Privacy-Challenge / Voice-Privacy-Challenge-2024
View on GitHub
Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software
☆63Jul 13, 2026Updated last week
DigitalPhonetics / speaker-anonymization
View on GitHub
Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.
☆104Jul 4, 2025Updated last year
Voice-Privacy-Challenge / Voice-Privacy-Challenge-2022
View on GitHub
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
☆69Jul 13, 2026Updated last week
deep-privacy / SA-toolkit
View on GitHub
SA-toolkit: Speaker speech anonymization toolkit in python
☆33Sep 18, 2025Updated 10 months ago
BakerBunker / SALT
View on GitHub
[ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation
☆23Aug 13, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
liuhuadai / ViT-TTS
View on GitHub
PyTorch Implementation of ViT-TTS (EMNLP'23)
☆11Oct 20, 2023Updated 2 years ago
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago
ashi-ta / speechGLUE
View on GitHub
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Jun 2, 2023Updated 3 years ago
PecholaL / MAIN-VC
View on GitHub
Lightweight Speech Representation Learning for One-Shot Voice Conversion
☆23Dec 12, 2024Updated last year
Voice-Privacy-Challenge / Voice-Privacy-Challenge-2020
View on GitHub
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf
☆63Jul 13, 2026Updated last week
Wataru-Nakata / ssl-vocoders
View on GitHub
Implementation of vocoders empowered with pytorch lightning
☆18Jan 27, 2024Updated 2 years ago
Voice-Privacy-Challenge / Voice-Privacy-Challenge-2026
View on GitHub
Baseline Recipe for VoicePrivacy Challenge 2026: anonymization systems and evaluation software
☆17Updated this week
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated last year
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tzuhsien / Voice-conversion-evaluation
View on GitHub
An evaluation toolkit for voice conversion models.
☆42Jul 11, 2021Updated 5 years ago
0nutation / SLMTokBench
View on GitHub
SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"
☆37Aug 29, 2023Updated 2 years ago
nii-yamagishilab / VCC2020-database
View on GitHub
☆53Dec 18, 2020Updated 5 years ago
uthree / tinyvc
View on GitHub
a lightweight voice conversion
☆87Feb 25, 2026Updated 5 months ago
unilight / seq2seq-vc
View on GitHub
A sequence-to-sequence voice conversion toolkit.
☆113Mar 15, 2026Updated 4 months ago
nii-yamagishilab / SSL-SAS
View on GitHub
Language independent SSL-based Speaker Anonymization system
☆20May 28, 2024Updated 2 years ago
ljuvela / SourceFilterNeuralFormants
View on GitHub
☆21Sep 20, 2024Updated last year
MingjieChen / EasyVC
View on GitHub
A toolkit for any-to-any encoder-decoder voice conversion systems
☆83Aug 10, 2023Updated 2 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hayeong0 / Diff-HierVC
View on GitHub
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…
☆237Jul 3, 2024Updated 2 years ago
soumimaiti / speechlmscore_tool
View on GitHub
☆34Nov 24, 2024Updated last year
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
apple / pytorch-speech-features
View on GitHub
☆87Apr 2, 2024Updated 2 years ago
hmohebbi / disentangling_representations
View on GitHub
☆14Oct 3, 2025Updated 9 months ago
pguyot / zamia-speech
View on GitHub
Open tools and data for cloudless automatic speech recognition
☆13Oct 1, 2019Updated 6 years ago
PecholaL / IDEAW
View on GitHub
Robust Neural Audio Watermarking with Invertible Dual-Embedding
☆32Nov 11, 2024Updated last year
placebokkk / ctc-asr
View on GitHub
pytorch CTC implementation for ASR. Use eesen's fst decoder framework
☆10Feb 27, 2020Updated 6 years ago
yangdongchao / ALMTokenizer
View on GitHub
The demo page for ALMTokenizer
☆59Apr 14, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sholokhovalexey / online-speaker-clustering
View on GitHub
[ICASSP'23] Online speaker clustering
☆18Feb 22, 2026Updated 5 months ago
papercup-open-source / subscale-wavernn
View on GitHub
Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo
☆19Oct 8, 2020Updated 5 years ago
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
kosta-pmf / audio-watermarking
View on GitHub
Implementations of different audio watermarking techniques
☆26Oct 17, 2022Updated 3 years ago
patrickvonplaten / audio-gen-dreambooth
View on GitHub
☆23Jun 13, 2023Updated 3 years ago
Alizeded / BERP
View on GitHub
[TASLP 2025] The pytorch implementation of BERP: A Blind Estimator of Room Parameters
☆21Aug 16, 2025Updated 11 months ago
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago