JiwanSeo / RAQ-VAE
Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models
☆10Updated 3 months ago
Alternatives and similar repositories for RAQ-VAE
Users that are interested in RAQ-VAE are comparing it to the libraries listed below
Sorting:
- TensorFlow implementation of "Finite Scalar Quantization: VQ-VAE Made Simple" (ICLR 2024)☆17Updated last year
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆19Updated 9 months ago
- ☆11Updated last year
- An official implementation of "Deep Joint Source-Channel Coding with Iterative Source Error Correction"☆19Updated 2 years ago
- Source code for DM-Codec.☆41Updated 6 months ago
- DiffPhase: Generative Diffusion-based STFT Phase Retrieval☆14Updated last year
- Source code of the paper titled "Digital Semantic Communications: An Alternating Multi-Phase Training Strategy with Mask Attack"☆14Updated 8 months ago
- ☆21Updated 4 months ago
- Public repository for the ICLR'23 paper "Few-shot domain adaptation for end-to-end communication"☆9Updated 2 years ago
- ☆60Updated 6 months ago
- Diffusion Models for Audio Semantic Communication☆14Updated last year
- A spoken version of the textual story cloze benchmark☆17Updated last year
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆74Updated last year
- ☆23Updated 7 months ago
- ESLTTS dataset☆16Updated 3 months ago
- A lightweight audio codec based on a single quantizer☆58Updated last month
- The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"☆28Updated last year
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆16Updated 3 months ago
- Implementation of O-OFDMNet, a deep learning-based optical OFDM system☆11Updated 3 years ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆15Updated 5 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆67Updated 8 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆13Updated last month
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆57Updated last year
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆41Updated 3 months ago
- The demo page for ALMTokenizer☆48Updated last month
- ☆20Updated 3 years ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 8 months ago
- ☆27Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆86Updated 7 months ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆11Updated 7 months ago