Diffusion Model for Voice Conversion
☆71Mar 14, 2024Updated 2 years ago
Alternatives and similar repositories for Diff-VC
Users that are interested in Diff-VC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆236Jul 3, 2024Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆95Feb 9, 2022Updated 4 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Dec 31, 2022Updated 3 years ago
- Keyword Spotting using BCResNet and Arcface Loss☆13Jan 28, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- My personal implementation of SVTR model for handwritten OCR☆15Mar 1, 2024Updated 2 years ago
- A curated list of awesome voice conversion, projects and communities.☆264Nov 18, 2025Updated 5 months ago
- ☆12Nov 7, 2024Updated last year
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆604Sep 18, 2023Updated 2 years ago
- ☆16Mar 20, 2026Updated last month
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆118May 27, 2021Updated 4 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆85Aug 3, 2023Updated 2 years ago
- pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020☆30Jul 6, 2023Updated 2 years ago
- ☆23Dec 14, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- ☆54Jul 16, 2025Updated 9 months ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆13Sep 29, 2025Updated 7 months ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated last year
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Mar 15, 2026Updated last month
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆154Oct 16, 2023Updated 2 years ago
- A sequence-to-sequence voice conversion toolkit.☆112Mar 15, 2026Updated last month
- Official Implementation of StyleTTS-VC☆198Jan 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆60Jul 23, 2024Updated last year
- The open source code for LLM-Codec☆147Aug 18, 2024Updated last year
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆372Sep 3, 2024Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆361Apr 27, 2022Updated 4 years ago
- ☆82Jan 22, 2025Updated last year
- ☆40Jan 24, 2023Updated 3 years ago
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆28Nov 12, 2025Updated 5 months ago
- Official implementation of SpeechSplit2☆136Oct 22, 2022Updated 3 years ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆92Jul 23, 2025Updated 9 months ago
- This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial ne…☆523Oct 11, 2019Updated 6 years ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago