VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
☆58May 14, 2024Updated last year
Alternatives and similar repositories for VoicePAT
Users that are interested in VoicePAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆62Feb 28, 2026Updated last month
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆98Jul 4, 2025Updated 9 months ago
- Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software☆69Oct 17, 2024Updated last year
- SA-toolkit: Speaker speech anonymization toolkit in python☆32Sep 18, 2025Updated 6 months ago
- Language independent SSL-based Speaker Anonymization system☆19May 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 8 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆64Jul 6, 2023Updated 2 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- ☆15May 8, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- ☆20Sep 20, 2024Updated last year
- ☆14Oct 3, 2025Updated 6 months ago
- ☆53Dec 18, 2020Updated 5 years ago
- a lightweight voice conversion☆86Feb 25, 2026Updated last month
- A sequence-to-sequence voice conversion toolkit.☆110Mar 15, 2026Updated 3 weeks ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- ☆32Nov 24, 2024Updated last year
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆235Jul 3, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- ☆85Apr 2, 2024Updated 2 years ago
- Open tools and data for cloudless automatic speech recognition☆11Oct 1, 2019Updated 6 years ago
- Robust Neural Audio Watermarking with Invertible Dual-Embedding☆31Nov 11, 2024Updated last year
- [ICASSP'23] Online speaker clustering☆17Feb 22, 2026Updated last month
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 6 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 4 months ago
- [TASLP 2025] The pytorch implementation of BERP: A Blind Estimator of Room Parameters☆21Aug 16, 2025Updated 7 months ago
- Implementations of different audio watermarking techniques☆25Oct 17, 2022Updated 3 years ago
- ☆44Sep 19, 2024Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- ☆23Jun 13, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year