An unofficial implementation of Vector Quantization Voice Conversion (VQVC).
☆29Apr 12, 2021Updated 4 years ago
Alternatives and similar repositories for VQVC-Pytorch
Users that are interested in VQVC-Pytorch are comparing it to the libraries listed below
Sorting:
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Nov 6, 2020Updated 5 years ago
- ☆24Mar 15, 2022Updated 3 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45May 25, 2023Updated 2 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Basic Tools☆13Dec 18, 2021Updated 4 years ago
- An implementation of SkipVQVC with various settings.☆75Jun 22, 2020Updated 5 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Oct 30, 2018Updated 7 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆18Jun 25, 2021Updated 4 years ago
- Collect Voice Conversion researches☆96Updated this week
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- A simple tutorial of Diffusion Probabilistic Models☆107Nov 30, 2024Updated last year
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample☆99Jul 26, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆95Feb 9, 2022Updated 4 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion☆41Mar 2, 2020Updated 6 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆71Aug 8, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Dec 18, 2018Updated 7 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆268Jan 13, 2025Updated last year
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago
- ☆22Jul 30, 2025Updated 7 months ago
- ☆45Dec 16, 2019Updated 6 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆197Feb 10, 2022Updated 4 years ago
- Deep Convolutional TTS pytorch implementation☆27Jul 2, 2019Updated 6 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆113Jun 6, 2022Updated 3 years ago