JiwanSeo / RAQ-VAELinks
Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models
☆13Updated 5 months ago
Alternatives and similar repositories for RAQ-VAE
Users that are interested in RAQ-VAE are comparing it to the libraries listed below
Sorting:
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆81Updated last year
- ☆11Updated last year
- PyTorch implementation of Swin Transformer for 1-dimensional data☆13Updated last year
- An official implementation of "Deep Joint Source-Channel Coding with Iterative Source Error Correction"☆20Updated 2 years ago
- DiffPhase: Generative Diffusion-based STFT Phase Retrieval☆15Updated last year
- official implementation of Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals☆12Updated 8 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆19Updated last year
- A spoken version of the textual story cloze benchmark☆18Updated last year
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- TensorFlow implementation of "Finite Scalar Quantization: VQ-VAE Made Simple" (ICLR 2024)☆19Updated last year
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆11Updated 10 months ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆17Updated 5 months ago
- Source code for DM-Codec.☆45Updated last month
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 10 months ago
- Code for "End-to-End Optimized Speech Coding with Deep Neural Networks" (ICASSP 2018)☆24Updated 7 years ago
- ☆14Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated last month
- ☆84Updated 2 years ago
- An ODE-based generative neural vocoder using Rectified Flow☆59Updated 2 years ago
- ☆20Updated 4 years ago
- Official PyTorch implementation for Maximum Likelihood Training of Implicit Nonlinear Diffusion Model (INDM) in NeurIPS 2022.☆40Updated last year
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆84Updated last year
- ☆25Updated 2 years ago
- Implementation of O-OFDMNet, a deep learning-based optical OFDM system☆11Updated 3 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆12Updated last year
- A lightweight audio codec based on a single quantizer☆64Updated 3 months ago
- ☆61Updated 8 months ago
- Solving Inverse Problems with Diffusion Optimal Control [NeurIPS 2024]☆12Updated 7 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆14Updated 10 months ago
- Deep Learning Model for Signal Data☆88Updated 5 years ago