kamepong / StarGAN-VC
☆24Updated 3 years ago
Alternatives and similar repositories for StarGAN-VC:
Users that are interested in StarGAN-VC are comparing it to the libraries listed below
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆85Updated 2 years ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆88Updated 3 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆116Updated last year
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆97Updated 2 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆57Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- ☆29Updated 3 years ago
- ☆117Updated 2 years ago
- ☆65Updated last year
- Official implementation of SpeechSplit2☆132Updated 2 years ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆56Updated 10 months ago
- ☆72Updated 3 months ago
- ☆64Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated last year
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆163Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆112Updated last year
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆144Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆76Updated 4 months ago
- ☆65Updated 7 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆43Updated 3 years ago
- ☆55Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆78Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 2 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆119Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year