JunwenBai / C-DSVAEView external linksLinks
Contrastively Disentangled Sequential Variational Audoencoder
☆48Oct 14, 2024Updated last year
Alternatives and similar repositories for C-DSVAE
Users that are interested in C-DSVAE are comparing it to the libraries listed below
Sorting:
- ☆18Jan 17, 2022Updated 4 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆20Jan 12, 2023Updated 3 years ago
- Official implementation of OSSGAN [CVPR 2022]☆21May 2, 2022Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆25Mar 12, 2022Updated 3 years ago
- BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data☆12May 25, 2022Updated 3 years ago
- ☆33Jan 14, 2023Updated 3 years ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆63Nov 5, 2025Updated 3 months ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- ☆61Oct 28, 2024Updated last year
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆56Dec 11, 2022Updated 3 years ago
- Demo for 2022 Interspeech☆29Jun 14, 2022Updated 3 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- ☆121Oct 24, 2022Updated 3 years ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- ☆23Jun 13, 2023Updated 2 years ago
- LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, w…☆16Nov 6, 2025Updated 3 months ago
- ☆37May 8, 2021Updated 4 years ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 7 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆18Dec 4, 2023Updated 2 years ago
- ☆18Jul 31, 2019Updated 6 years ago
- Variations of L1 SNR Loss function for training audio source separation machine learning models☆44Feb 4, 2026Updated last week
- ☆69Mar 31, 2021Updated 4 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 4 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆90Jun 9, 2022Updated 3 years ago
- ☆47Aug 31, 2024Updated last year
- ☆49Apr 1, 2025Updated 10 months ago
- ☆18Feb 9, 2020Updated 6 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- ☆21Apr 24, 2025Updated 9 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆51Mar 17, 2025Updated 11 months ago