JiwanSeo / RAQ-VAELinks
Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models
☆13Updated 6 months ago
Alternatives and similar repositories for RAQ-VAE
Users that are interested in RAQ-VAE are comparing it to the libraries listed below
Sorting:
- TensorFlow implementation of "Finite Scalar Quantization: VQ-VAE Made Simple" (ICLR 2024)☆20Updated last year
- Official implementation of INTERSPECCH 2022 Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals☆12Updated 9 months ago
- ☆11Updated last year
- An official implementation of "Deep Joint Source-Channel Coding with Iterative Source Error Correction"☆22Updated 2 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆19Updated last year
- DiffPhase: Generative Diffusion-based STFT Phase Retrieval☆16Updated last year
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆81Updated last year
- Implementation of O-OFDMNet, a deep learning-based optical OFDM system☆11Updated 3 years ago
- https://arxiv.org/abs/2111.00195☆15Updated 3 years ago
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆190Updated 3 years ago
- A spoken version of the textual story cloze benchmark☆18Updated 2 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆44Updated 2 years ago
- Source code of the paper titled "Digital Semantic Communications: An Alternating Multi-Phase Training Strategy with Mask Attack"☆14Updated last year
- ☆27Updated last year
- Code for "Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission", Arxiv☆23Updated last month
- ☆25Updated 2 years ago
- PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition☆15Updated 4 years ago
- ☆31Updated last year
- The official deployment of MambaJSCC in pytorch☆13Updated 4 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆61Updated 2 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆15Updated 11 months ago
- Code for "End-to-End Optimized Speech Coding with Deep Neural Networks" (ICASSP 2018)☆24Updated 7 years ago
- This repository contains the code for implementing the algorithms in the paper "Semantics-Guided Diffusion for Deep Joint Source-Channel …☆19Updated 4 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Updated 2 weeks ago
- Source code for DM-Codec.☆47Updated 2 months ago
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆41Updated 6 months ago
- ☆84Updated 2 years ago
- ☆15Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated 11 months ago