WangHelin1997/DuTa-VC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WangHelin1997/DuTa-VC)

WangHelin1997 / DuTa-VC

Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

☆38

Alternatives and similar repositories for DuTa-VC

Users that are interested in DuTa-VC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
arnabdas8901 / StarGAN-VC_PlusPlus
View on GitHub
☆11Aug 11, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
NKU-HLT / PB-DSR
View on GitHub
[Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
☆14Nov 28, 2024Updated last year
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
trinhtuanvubk / Diff-VC
View on GitHub
Diffusion Model for Voice Conversion
☆72Mar 14, 2024Updated 2 years ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
odunola499 / f5-lora
View on GitHub
☆19Nov 18, 2025Updated 8 months ago
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
xiuwenz2 / SAPC-template
View on GitHub
☆17Mar 20, 2026Updated 4 months ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
MelissaChen15 / control-vc
View on GitHub
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
☆132Nov 29, 2023Updated 2 years ago
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
thuhcsi / LightGrad
View on GitHub
☆68Jul 23, 2023Updated 3 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
adelacvg / NS2VC
View on GitHub
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
☆237Feb 29, 2024Updated 2 years ago
interactiveaudiolab / ppgs
View on GitHub
High-Fidelity Neural Phonetic Posteriorgrams
☆124Feb 23, 2025Updated last year
revsic / torch-nansy
View on GitHub
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
☆64Feb 13, 2023Updated 3 years ago
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
revsic / torch-whisper-guided-vc
View on GitHub
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Mar 7, 2023Updated 3 years ago
cjerry1243 / TransferLearning-CLVC
View on GitHub
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
☆40Oct 22, 2022Updated 3 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
bshall / urhythmic
View on GitHub
Unsupervised Rhythm Modeling for Voice Conversion
☆85Aug 3, 2023Updated 2 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
RF5 / transfusion-asr
View on GitHub
Transcribing Speech with Multinomial Diffusion, training code and models.
☆80Sep 27, 2023Updated 2 years ago
BUTSpeechFIT / TS_SUPERB
View on GitHub
☆16Apr 2, 2025Updated last year
NUS-HPC-AI-Lab / MoST
View on GitHub
MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts
☆33Jan 15, 2026Updated 6 months ago