zsl24 / Mel-GAN-Voice-Conversion-Demo
A Demo for real-time voice conversion based on Mel-GAN
☆12Updated 3 years ago
Alternatives and similar repositories for Mel-GAN-Voice-Conversion-Demo:
Users that are interested in Mel-GAN-Voice-Conversion-Demo are comparing it to the libraries listed below
- transcribe guitar solo audio to midi-like tab.☆11Updated 2 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆144Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆78Updated 2 years ago
- ☆31Updated last year
- ☆98Updated 4 months ago
- ☆22Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- ☆44Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year
- Singing Voice Speech modeling test☆35Updated 2 years ago
- ☆87Updated 2 years ago
- ☆25Updated 5 months ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year
- Collect Voice Conversion researches☆91Updated this week
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆50Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆51Updated 10 months ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 4 years ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆61Updated 9 months ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆73Updated 10 months ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆68Updated 3 years ago
- ☆37Updated 9 months ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆42Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- ☆39Updated last year
- ☆65Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated 2 years ago