samsad35 / source-filter-vae
Learning and controlling the source-filter representation of speech with a variational autoencoder
☆45Updated last year
Alternatives and similar repositories for source-filter-vae:
Users that are interested in source-filter-vae are comparing it to the libraries listed below
- ☆60Updated last year
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- ☆21Updated last year
- ☆24Updated 2 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated 11 months ago
- ☆29Updated 3 months ago
- Implementation of SpatialCodec.☆55Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- ☆87Updated 2 years ago
- ☆59Updated 4 months ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Updated 4 years ago
- ☆48Updated 2 years ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆49Updated last month
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- An ODE-based generative neural vocoder using Rectified Flow☆60Updated last year
- ☆64Updated last year
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆74Updated 3 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆72Updated last month
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆24Updated last year
- ☆32Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- ☆25Updated 6 months ago
- HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)☆78Updated last year
- ☆64Updated 2 years ago
- ☆44Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆52Updated 3 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆79Updated 2 months ago
- Viterbi decoding in PyTorch☆27Updated this week