nafiuny / ICRCycleGAN-VC
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆13Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for ICRCycleGAN-VC
- ☆18Updated 5 months ago
- Implementation of Emo-StarGAN☆46Updated 10 months ago
- Official implementation of OSSGAN [CVPR 2022]☆22Updated 2 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆15Updated 11 months ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆35Updated last year
- Official repository for Polarity Sampling, CVPR 2022 ORAL☆12Updated 2 years ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆26Updated last year
- 2019_ML_Course Singing Voice Conversion Using Cycle-GAN:VC2☆15Updated 3 years ago
- A simple voice conversion tool☆15Updated 2 years ago
- StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and imp…☆10Updated last year
- Finally, some decent sample sentences☆22Updated 11 months ago
- Supervoice Speaker Separation Network☆13Updated 5 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆16Updated 2 months ago
- ☆29Updated last month
- Codebase and project page for EDMSound☆29Updated 11 months ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆23Updated 2 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 2 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆15Updated last month
- ☆10Updated last year
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆27Updated last year
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- SRTNet☆24Updated last year
- ☆23Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆16Updated 2 months ago
- ☆48Updated last year
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- Zero-Shot Emotion Style Transfer☆37Updated 7 months ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year