lyndonzheng/CVQ-VAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lyndonzheng/CVQ-VAE)

lyndonzheng / CVQ-VAE

[ICCV 2023] Online Clustered Codebook

☆189

Alternatives and similar repositories for CVQ-VAE

Users that are interested in CVQ-VAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

magic-research / vector_quantization
View on GitHub
[NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation
☆21Dec 17, 2024Updated last year
youngsheen / SimVQ
View on GitHub
[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
☆328Dec 29, 2024Updated last year
minyoungg / vqtorch
View on GitHub
☆145Feb 27, 2024Updated 2 years ago
lucidrains / vector-quantize-pytorch
View on GitHub
Vector (and Scalar) Quantization, in Pytorch
☆3,992Jul 20, 2026Updated last week
innnky / descript-audio-vae
View on GitHub
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
☆92Apr 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ziplab / SN-Netv2
View on GitHub
[ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".
☆29Jan 23, 2024Updated 2 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
ai-forever / MoVQGAN
View on GitHub
MoVQGAN - model for the image encoding and reconstruction
☆266Oct 31, 2023Updated 2 years ago
yangdongchao / SimpleSpeech
View on GitHub
The open source code for SimpleSpeech series
☆147Oct 8, 2024Updated last year
zh460045050 / VQGAN-LC
View on GitHub
☆145Jun 28, 2024Updated 2 years ago
thuanz123 / enhancing-transformers
View on GitHub
An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch
☆324Apr 7, 2025Updated last year
sony / sqvae
View on GitHub
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆196Jul 20, 2022Updated 4 years ago
modelscope / FunCodec
View on GitHub
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…
☆445Jan 25, 2024Updated 2 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qiuk2 / AAR
View on GitHub
[Official Implementation] Acoustic Autoregressive Modeling 🔥
☆74Aug 24, 2024Updated last year
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
zhaoyue-zephyrus / npq-vit
View on GitHub
[ICLR 2025] Binary Spherical Quantization + [CVPR 2026] Leech Spherical Quantization
☆222Dec 18, 2025Updated 7 months ago
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
CrossmodalGroup / DynamicVectorQuantization
View on GitHub
Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…
☆194Jul 23, 2023Updated 3 years ago
adobe-research / ImageFolder
View on GitHub
☆20Dec 8, 2024Updated last year
CUC-MIPG / VQGAN-Compression
View on GitHub
Extreme Image Compression using Fine-tuned VQGAN Models (DCC 2024)
☆24Jan 14, 2025Updated last year
Andong-Li-speech / BridgeVoC
View on GitHub
This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".
☆67Nov 5, 2025Updated 8 months ago
kakaobrain / rq-vae-transformer
View on GitHub
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
☆1,028Jan 3, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
Ereboas / MagiCodec
View on GitHub
A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.
☆125Jun 4, 2025Updated last year
jiasenlu / vit-vqgan-jax
View on GitHub
Jax implementation of VIT-VQGAN
☆10Jan 25, 2024Updated 2 years ago
yangdongchao / ALMTokenizer2
View on GitHub
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆45Sep 5, 2025Updated 10 months ago
wu-zhonghua / DAT
View on GitHub
☆18Oct 4, 2022Updated 3 years ago
b04901014 / UUVC
View on GitHub
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆83Jan 7, 2023Updated 3 years ago
zhenye234 / FlashSpeech
View on GitHub
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis
☆156Sep 20, 2024Updated last year
richardbaihe / a3t
View on GitHub
Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
☆89Sep 6, 2024Updated last year
zhai-lw / SQCodec
View on GitHub
A lightweight audio codec based on a single quantizer
☆72Aug 15, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lxa9867 / ImageFolder
View on GitHub
High-performance Image Tokenizers for VAR and AR
☆307Apr 25, 2025Updated last year
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
kvfrans / jax-vqvae-vqgan
View on GitHub
JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)
☆42Jun 6, 2024Updated 2 years ago
beiluo97 / HFLIC
View on GitHub
Official code for Human Friendly Perceptual Learned Image Compression with Reinforced Transform and Unofficial Implementation of papar "P…
☆24Aug 10, 2023Updated 2 years ago
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
huggingface / open-muse
View on GitHub
Open reproduction of MUSE for fast text2image generation.
☆358Jun 1, 2024Updated 2 years ago
haiciyang / LaDiffCodec
View on GitHub
ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.
☆56Nov 16, 2025Updated 8 months ago