VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
☆57May 14, 2024Updated last year
Alternatives and similar repositories for VoicePAT
Users that are interested in VoicePAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆62Feb 28, 2026Updated 3 weeks ago
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆92Jul 4, 2025Updated 8 months ago
- Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software☆69Oct 17, 2024Updated last year
- SA-toolkit: Speaker speech anonymization toolkit in python☆30Sep 18, 2025Updated 6 months ago
- Language independent SSL-based Speaker Anonymization system☆19May 28, 2024Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆64Jul 6, 2023Updated 2 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- ☆15May 8, 2021Updated 4 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- ☆20Sep 20, 2024Updated last year
- ☆53Dec 18, 2020Updated 5 years ago
- ☆14Oct 3, 2025Updated 5 months ago
- a lightweight voice conversion☆86Feb 25, 2026Updated 3 weeks ago
- A sequence-to-sequence voice conversion toolkit.☆108Mar 15, 2026Updated last week
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆32Nov 24, 2024Updated last year
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆235Jul 3, 2024Updated last year
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- ☆85Apr 2, 2024Updated last year
- Open tools and data for cloudless automatic speech recognition☆11Oct 1, 2019Updated 6 years ago
- Robust Neural Audio Watermarking with Invertible Dual-Embedding☆30Nov 11, 2024Updated last year
- [ICASSP'23] Online speaker clustering☆17Feb 22, 2026Updated last month
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 6 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 4 months ago
- [TASLP 2025] The pytorch implementation of BERP: A Blind Estimator of Room Parameters☆21Aug 16, 2025Updated 7 months ago
- ☆44Sep 19, 2024Updated last year
- Implementations of different audio watermarking techniques☆25Oct 17, 2022Updated 3 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- ☆23Jun 13, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year