Jackson-Kang/VQVC-Pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jackson-Kang/VQVC-Pytorch)

Jackson-Kang / VQVC-Pytorch

An unofficial implementation of Vector Quantization Voice Conversion (VQVC).

☆29

Alternatives and similar repositories for VQVC-Pytorch

Users that are interested in VQVC-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jackson-Kang / Prosody-augmentation-for-Text-to-speech
View on GitHub
Simple tool for speech dataset augmentation for modeling various prosodies.
☆14Jan 14, 2021Updated 5 years ago
Jackson-Kang / MFARunner
View on GitHub
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45May 25, 2023Updated 3 years ago
GrantL10 / My-Python-Codes-for-Acoustics
View on GitHub
Basic Tools
☆13Dec 18, 2021Updated 4 years ago
hrnoh / f0-autovc
View on GitHub
Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"
☆29Nov 6, 2020Updated 5 years ago
Jackson-Kang / Pytorch-Diffusion-Model-Tutorial
View on GitHub
A simple tutorial of Diffusion Probabilistic Models
☆114Nov 30, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bshall / VectorQuantizedCPC
View on GitHub
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Sep 1, 2020Updated 5 years ago
ericwudayi / SkipVQVC
View on GitHub
An implementation of SkipVQVC with various settings.
☆75Jun 22, 2020Updated 6 years ago
Jackson-Kang / Korean-phoneme-dictionary-generator
View on GitHub
Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)
☆13Feb 27, 2021Updated 5 years ago
vsimkus / vae-voice-conversion
View on GitHub
Voice conversion (VC) investigation using three variants of VAE
☆59Oct 28, 2019Updated 6 years ago
HGU-DLLAB / Korean-FastSpeech2-Pytorch
View on GitHub
Implementation of Korean FastSpeech2
☆215Jan 29, 2023Updated 3 years ago
Jackson-Kang / Awesome-DL-based-Text-to-speech-Papers-and-Resources
View on GitHub
Various Text-to-speech (TTS) papers based on Deep-learning
☆14Feb 26, 2021Updated 5 years ago
Wendison / VQMIVC
View on GitHub
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
☆361Apr 27, 2022Updated 4 years ago
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
Joshua-1995 / LearnableUpsamplingLayer-Pytorch
View on GitHub
Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)
☆57Mar 12, 2024Updated 2 years ago
onejiin / CycleGAN-VC2
View on GitHub
CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion
☆42Mar 2, 2020Updated 6 years ago
gayanechilingar / Change-Emotions
View on GitHub
Nonparallel Emotional Speech Conversion with MUNIT. Introduction: This is a tensorflow implementation of paper(https://arxiv.org/pdf/1811…
☆14Oct 13, 2021Updated 4 years ago
carl-robinson / voice-emotion-seq2seq
View on GitHub
Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.
☆27Oct 30, 2018Updated 7 years ago
Yangyangii / pytorch-practice
View on GitHub
pytorch basic
☆24Jan 8, 2019Updated 7 years ago
CODEJIN / MLPSinger
View on GitHub
☆24Mar 15, 2022Updated 4 years ago
itec-hust / MusicYOLO
View on GitHub
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
☆18Jan 29, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WICWIU / WICWIU
View on GitHub
WICWIU(What I can Create is What I Understand)
☆106Jan 7, 2023Updated 3 years ago
ktho22 / vctts
View on GitHub
pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020
☆30Jul 6, 2023Updated 3 years ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
jlian2 / Robust-Voice-Style-Transfer
View on GitHub
Demo for 2022 ICASSP
☆64Jun 14, 2022Updated 4 years ago
NingMiao / InteL-VAEs
View on GitHub
Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.
☆18Jun 25, 2021Updated 5 years ago
KunZhou9646 / Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
View on GitHub
This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…
☆90Nov 13, 2020Updated 5 years ago
KunZhou9646 / Emovox
View on GitHub
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
☆95Feb 9, 2022Updated 4 years ago
tarepan / VoiceConversionLab
View on GitHub
Collect Voice Conversion researches
☆97Updated this week
KimythAnly / AGAIN-VC
View on GitHub
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…
☆114Dec 7, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
anonymous-pits / pits
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆280Jul 16, 2023Updated 3 years ago
v-nhandt21 / MusicVoiceConversion
View on GitHub
Sing any popular song with your voice
☆11Jul 10, 2022Updated 4 years ago
cjerry1243 / TransferLearning-CLVC
View on GitHub
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
☆40Oct 22, 2022Updated 3 years ago
Lukelluke / MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
View on GitHub
Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…
☆22Sep 4, 2020Updated 5 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
walker-hyf / FCTalker
View on GitHub
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆26Feb 22, 2024Updated 2 years ago
revsic / torch-diffusion-wavegan
View on GitHub
Parallel waveform generation with DiffusionGAN
☆17Mar 26, 2022Updated 4 years ago