zcf28/StyleGAN-VC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zcf28/StyleGAN-VC)

zcf28 / StyleGAN-VC

Voice Conversion method based on speaker style

☆14

Alternatives and similar repositories for StyleGAN-VC

Users that are interested in StyleGAN-VC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MingjieChen / VoiceConversionGANs
View on GitHub
GAN series for voice conversion on VCC2018 dataset
☆17Aug 27, 2020Updated 5 years ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
RMSnow / HAT
View on GitHub
Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.
☆14Mar 22, 2023Updated 3 years ago
TheShadow29 / VC-with-GAN
View on GitHub
Voice Conversion with GANs
☆15Jul 5, 2018Updated 8 years ago
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jshong0907 / SingingVoiceConversion
View on GitHub
2019_ML_Course Singing Voice Conversion Using Cycle-GAN:VC2
☆17Dec 30, 2020Updated 5 years ago
gteu / realtime-ppg-vc
View on GitHub
Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.
☆29Mar 3, 2022Updated 4 years ago
cyhuang-tw / AutoVC
View on GitHub
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
☆34Apr 26, 2021Updated 5 years ago
dwgnr / speech-conversion
View on GitHub
Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE
☆15Dec 3, 2022Updated 3 years ago
hubertsiuzdak / voice-conversion
View on GitHub
Voice conversion using deep adversarial learning
☆17Oct 29, 2021Updated 4 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
LJY-M / ppg_tacotron
View on GitHub
An implementation of deep-voice-conversion using pytorch
☆19Dec 10, 2021Updated 4 years ago
jeremychee4 / AffectSpeech
View on GitHub
AffectSpeech: A Large-Scale Emotional Speech Dataset with Fine-Grained Textual Descriptions for Speech Emotion Captioning and Synthesis
☆68Jun 12, 2026Updated last month
SolomidHero / real-time-voice-conversion
View on GitHub
Toolbox for easy and qualitative one-shot voice conversion
☆48Dec 5, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
freenowill / AutoVC-WavRNN
View on GitHub
voice conversion system
☆25Jun 10, 2020Updated 6 years ago
himajin2045 / voice-conversion
View on GitHub
Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.
☆23Jan 24, 2021Updated 5 years ago
k2kobayashi / crank
View on GitHub
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
☆171Jul 25, 2024Updated last year
KunZhou9646 / seq2seq-EVC
View on GitHub
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…
☆87Dec 31, 2022Updated 3 years ago
qiuqiao / DDSP-HiFiGAN
View on GitHub
基于PC-DDSP和nsf-HiFiGAN的声码器
☆19Jul 17, 2023Updated 3 years ago
avi33 / universalmelgan
View on GitHub
This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631
☆23Aug 15, 2022Updated 3 years ago
anas-rz / specmix-pytorch
View on GitHub
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆10Oct 5, 2022Updated 3 years ago
georgid / AlignmentEvaluation
View on GitHub
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…
☆18Oct 27, 2020Updated 5 years ago
deezer / MultilingualLyricsToAudioAlignment
View on GitHub
DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).
☆13May 25, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
liusongxiang / ppg-vc
View on GitHub
PPG-Based Voice Conversion
☆348Jul 22, 2022Updated 3 years ago
qiuqiangkong / dcase2019_task1
View on GitHub
☆20May 13, 2019Updated 7 years ago
WangHelin1997 / DuTa-VC
View on GitHub
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆38Dec 5, 2023Updated 2 years ago
shackysureshot / StarGAN-Voice-Conversion-2
View on GitHub
A pytorch implementation of StarGAN-VC2
☆150Sep 11, 2020Updated 5 years ago
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
Livefull / SphereDiar
View on GitHub
☆11May 4, 2020Updated 6 years ago
rkmt / wesper-demo
View on GitHub
☆36Dec 25, 2023Updated 2 years ago
shuheikatoinfo / UtterTune
View on GitHub
LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…
☆26Jul 8, 2026Updated last week
d-dimos / microprocessors_laboratory_ntua
View on GitHub
[ECE NTUA] Microprocessors Laboratory - Exercise Sets & Solutions (2020-2021)
☆14Jul 29, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
vBaiCai / vc_tacotron
View on GitHub
Voice Conversion using Tacotron.
☆11Dec 29, 2022Updated 3 years ago
KnurpsBram / AutoVC_WavenetVocoder_GriffinLim_experiments
View on GitHub
Experiments on AutoVC and WaveNet vocoder, compared against the Griffin Lim spectrogram inversion algorithm
☆11Jun 18, 2020Updated 6 years ago
nihal111 / voice-conversion
View on GitHub
Machine Learning course project to convert a source voice into a target voice.
☆13May 26, 2018Updated 8 years ago
acetylSv / non-parallel-rhythm-flexible-VC
View on GitHub
PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
☆11Jul 18, 2019Updated 7 years ago
michaelmorr82 / Machine-Learning-Coursera-Andrew-Ng
View on GitHub
Matlaba and Python Solutions on machine learnign coursera on Coursera by Andrew Ng
☆11Jun 23, 2018Updated 8 years ago
a43992899 / DeID-VC
View on GitHub
Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion
☆13May 6, 2023Updated 3 years ago
Kikyo-16 / coco-mulla-repo
View on GitHub
Official source codes of coco-mulla
☆36Mar 21, 2024Updated 2 years ago