gustavo-beck / wavebender-gan
☆22Updated 2 years ago
Alternatives and similar repositories for wavebender-gan:
Users that are interested in wavebender-gan are comparing it to the libraries listed below
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆22Updated 10 months ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆28Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- ☆16Updated 2 years ago
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- ESLTTS dataset☆16Updated 6 months ago
- A unified model for zero-shot singing voice conversion and synthesis☆21Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- GPT for FACodec☆13Updated 9 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆44Updated 6 months ago
- ☆15Updated 3 years ago
- ☆24Updated last year
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆49Updated 2 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆19Updated 3 months ago
- ☆13Updated last year
- A spoken version of the textual story cloze benchmark☆14Updated last year
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- ☆30Updated 2 years ago
- ☆35Updated 4 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆35Updated last year
- ☆25Updated last year
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆23Updated 2 years ago
- ☆24Updated 2 years ago
- Digital Speech Processing in PyTorch.☆14Updated 2 years ago