gustavo-beck / wavebender-gan
☆22Updated last year
Related projects: ⓘ
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆21Updated 6 months ago
- Training code and trained checkpoints for ASGAN.☆60Updated 8 months ago
- ☆28Updated this week
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆40Updated 2 months ago
- ☆13Updated 9 months ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- ☆25Updated this week
- Viterbi decoding in PyTorch☆23Updated 3 weeks ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆16Updated last year
- ☆18Updated 3 months ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- A spoken version of the textual story cloze benchmark☆12Updated last year
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆27Updated 9 months ago
- ☆15Updated 3 years ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆32Updated last month
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆24Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated last year
- ☆25Updated last year
- A unified model for zero-shot singing voice conversion and synthesis☆21Updated last year
- ☆24Updated 2 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- ☆24Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆33Updated 9 months ago
- ☆26Updated this week
- ☆21Updated last year
- GPT for FACodec☆13Updated 5 months ago
- Codebase and project page for EDMSound☆29Updated 9 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆29Updated 8 months ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated last month