Diffusion Model for Voice Conversion
☆69Mar 14, 2024Updated 2 years ago
Alternatives and similar repositories for Diff-VC
Users that are interested in Diff-VC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆235Jul 3, 2024Updated last year
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆95Feb 9, 2022Updated 4 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Dec 31, 2022Updated 3 years ago
- My personal implementation of SVTR model for handwritten OCR☆15Mar 1, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of awesome voice conversion, projects and communities.☆263Nov 18, 2025Updated 4 months ago
- ☆12Nov 7, 2024Updated last year
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆602Sep 18, 2023Updated 2 years ago
- ☆17Mar 20, 2026Updated last week
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago
- pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020☆30Jul 6, 2023Updated 2 years ago
- ☆23Dec 14, 2021Updated 4 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆55Jul 16, 2025Updated 8 months ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Mar 15, 2026Updated last week
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆155Oct 16, 2023Updated 2 years ago
- A sequence-to-sequence voice conversion toolkit.☆108Mar 15, 2026Updated last week
- Official Implementation of StyleTTS-VC☆198Jan 14, 2025Updated last year
- The open source code for LLM-Codec☆145Aug 18, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆372Sep 3, 2024Updated last year
- ☆82Jan 22, 2025Updated last year
- ☆40Jan 24, 2023Updated 3 years ago
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆26Nov 12, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official implementation of SpeechSplit2☆135Oct 22, 2022Updated 3 years ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 8 months ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆40Aug 4, 2023Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago