ryuzho/DiffVC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ryuzho/DiffVC)

ryuzho / DiffVC

Diffusion Model for Voice Conversion

☆17

Alternatives and similar repositories for DiffVC

Users that are interested in DiffVC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AJOU-DEVELOPERS / Do-IT-Page
View on GitHub
아주대학교 개발 중앙동아리 Do-IT 웹 페이지 프로젝트
☆12Apr 4, 2022Updated 4 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
MaxMax2016 / max-vc
View on GitHub
singing voice conversion without f0
☆23May 10, 2023Updated 3 years ago
vtuber-plan / vcvits
View on GitHub
Non Parallel Voice Conversion based on VITS
☆24Mar 31, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
innnky / vispeech
View on GitHub
基于vits fastspeech2 visinger的tts模型
☆24Mar 9, 2023Updated 3 years ago
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
MelissaChen15 / control-vc
View on GitHub
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
☆132Nov 29, 2023Updated 2 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
Li-Muyang / MLP4Rec
View on GitHub
IJCAI 2022 MLP4Rec
☆17Sep 5, 2022Updated 3 years ago
Topaz1618 / CycleganSA
View on GitHub
💻 🐈 Added a self-attention layer to the CycleGAN implementation (PyTorch).
☆13May 31, 2024Updated 2 years ago
redbug312 / midi-visualizer
View on GitHub
Visualize MIDIs as piano tutorials
☆15Aug 16, 2022Updated 3 years ago
ZihaoZhao / Pytorch-ASR-WaveNet
View on GitHub
A Pytorch implementation of WaveNet ASR (Automatic Speech Recognition)
☆13Sep 22, 2021Updated 4 years ago
ajaybati / miipher2.0
View on GitHub
Reimplementation of Miipher
☆30Aug 16, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
tony10101105 / HEAR-2021-NeurIPS-Challenge---NTU-GURA
View on GitHub
☆13Mar 7, 2022Updated 4 years ago
freenowill / AutoVC-WavRNN
View on GitHub
voice conversion system
☆25Jun 10, 2020Updated 6 years ago
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
howard1337 / S2VC
View on GitHub
☆100Jul 22, 2021Updated 5 years ago
hubertsiuzdak / voice-conversion
View on GitHub
Voice conversion using deep adversarial learning
☆17Oct 29, 2021Updated 4 years ago
bottlecapper / EmoCycleGAN
View on GitHub
Emotional Speech Conversion using Nonparallel Data
☆17Apr 10, 2019Updated 7 years ago
Devil4ngle / SquadMortarOverlay
View on GitHub
Squad Mortar Overlay: Overlays Squad map with the SquadCalc map
☆13Jun 14, 2026Updated last month
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ktho22 / vctts
View on GitHub
pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020
☆30Jul 6, 2023Updated 3 years ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
jxmorris12 / synthviz
View on GitHub
visualize MIDI files from piano MIDI or audio
☆22Feb 22, 2022Updated 4 years ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
yqzhishen / onnxcrepe
View on GitHub
ONNX deployment of the CREPE pitch tracker
☆27Oct 27, 2022Updated 3 years ago
philgzl / brever
View on GitHub
Speech enhancement in noisy and reverberant environments using deep neural networks
☆23Oct 10, 2025Updated 9 months ago
ConsistencyVC / ConsistencyVC-voive-conversion
View on GitHub
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
☆154Oct 16, 2023Updated 2 years ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
suzuki256 / dog-dataset
View on GitHub
☆47Jul 15, 2022Updated 4 years ago
MaxMax2016 / Glow-SVC
View on GitHub
4G GPU & 10 Minutes for train
☆12Aug 9, 2023Updated 2 years ago
xieyuankun / FSD-Dataset
View on GitHub
This repository presents FSD dataset for song deepfake detection.
☆24Aug 18, 2025Updated 11 months ago
YangZhengyi98 / RecInterpreter
View on GitHub
☆25Nov 16, 2023Updated 2 years ago
indri-voice / audiotoken
View on GitHub
Audio tokenization, in the fastest way possible!
☆54Aug 26, 2024Updated last year