VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
☆59May 14, 2024Updated 2 years ago
Alternatives and similar repositories for VoicePAT
Users that are interested in VoicePAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆62Feb 28, 2026Updated 2 months ago
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆99Jul 4, 2025Updated 10 months ago
- Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software☆69Oct 17, 2024Updated last year
- SA-toolkit: Speaker speech anonymization toolkit in python☆33Sep 18, 2025Updated 8 months ago
- Language independent SSL-based Speaker Anonymization system☆20May 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆22Aug 13, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 9 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆63Jul 6, 2023Updated 2 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- ☆15May 8, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- ☆20Sep 20, 2024Updated last year
- ☆14Oct 3, 2025Updated 7 months ago
- ☆53Dec 18, 2020Updated 5 years ago
- a lightweight voice conversion☆86Feb 25, 2026Updated 3 months ago
- A sequence-to-sequence voice conversion toolkit.☆113Mar 15, 2026Updated 2 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆16Mar 26, 2022Updated 4 years ago
- ☆32Nov 24, 2024Updated last year
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆238Jul 3, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- ☆86Apr 2, 2024Updated 2 years ago
- Open tools and data for cloudless automatic speech recognition☆12Oct 1, 2019Updated 6 years ago
- Robust Neural Audio Watermarking with Invertible Dual-Embedding☆32Nov 11, 2024Updated last year
- [ICASSP'23] Online speaker clustering☆18Feb 22, 2026Updated 3 months ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 6 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆19Nov 19, 2025Updated 6 months ago
- [TASLP 2025] The pytorch implementation of BERP: A Blind Estimator of Room Parameters☆21Aug 16, 2025Updated 9 months ago
- Implementations of different audio watermarking techniques☆26Oct 17, 2022Updated 3 years ago
- ☆44Sep 19, 2024Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆79Nov 1, 2024Updated last year
- ☆23Jun 13, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year