JiwanSeo / RAQ-VAELinks
Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models
☆15Updated 3 months ago
Alternatives and similar repositories for RAQ-VAE
Users that are interested in RAQ-VAE are comparing it to the libraries listed below
Sorting:
- ☆14Updated 2 years ago
- An official implementation of "Deep Joint Source-Channel Coding with Iterative Source Error Correction"☆22Updated 2 years ago
- Latent Diffusion Model-Enabled Low-Latency Semantic Communication in the Presence of Semantic Ambiguities and Wireless Channel Noises☆17Updated last year
- Implementation of O-OFDMNet, a deep learning-based optical OFDM system☆11Updated 3 years ago
- Code for "Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission", IEEE TMLCN☆28Updated 4 months ago
- This is a pytorch implementation of digital semantic communication.☆18Updated 11 months ago
- The official deployment of MambaJSCC in pytorch☆17Updated 3 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Updated last year
- This is the code of the paper "SpectrumFM: A Foundation Model for Intelligent Spectrum Management"☆20Updated last month
- ☆23Updated 11 months ago
- TensorFlow implementation of "Finite Scalar Quantization: VQ-VAE Made Simple" (ICLR 2024)☆21Updated 2 years ago
- DiffPhase: Generative Diffusion-based STFT Phase Retrieval☆16Updated 2 years ago
- ☆28Updated last year
- ☆19Updated 4 years ago
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆86Updated 2 years ago
- This repository contains the code for implementing the algorithms in the paper "Semantics-Guided Diffusion for Deep Joint Source-Channel …☆25Updated 8 months ago
- Official implementation of INTERSPECCH 2022 Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals☆15Updated 2 months ago
- Towards Real-Time Practical Image Compression with Lightweight Attention☆11Updated last year
- A spoken version of the textual story cloze benchmark☆19Updated 2 years ago
- ☆25Updated 2 years ago
- ☆16Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆42Updated last year
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆62Updated 2 years ago
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆41Updated 10 months ago
- ☆85Updated 2 years ago
- Source code of the paper titled "Digital Semantic Communications: An Alternating Multi-Phase Training Strategy with Mask Attack"☆14Updated 2 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Updated last year
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆47Updated 3 months ago
- A lightweight audio codec based on a single quantizer☆65Updated 3 months ago