Neur-IO/OptVQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Neur-IO/OptVQ)

Neur-IO / OptVQ

Towards training VQ-VAE models robustly!

☆94

Alternatives and similar repositories for OptVQ

Users that are interested in OptVQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zh460045050 / VQGAN-LC
View on GitHub
☆145Jun 28, 2024Updated 2 years ago
huang-yh / Terra
View on GitHub
☆31Oct 17, 2025Updated 9 months ago
YuqingWang1029 / TokenBridge
View on GitHub
[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…
☆158Jul 24, 2025Updated 11 months ago
markweberdev / maskbit
View on GitHub
Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"
☆94Apr 10, 2025Updated last year
DAMO-NLP-SG / DiGIT
View on GitHub
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆78Oct 31, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,017Nov 25, 2025Updated 7 months ago
youngsheen / SimVQ
View on GitHub
[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
☆327Dec 29, 2024Updated last year
ChrisDong-THU / GaussianToken
View on GitHub
Official PyTorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting
☆107Apr 3, 2025Updated last year
turingmotors / One-D-Piece
View on GitHub
[ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
☆81Jul 30, 2025Updated 11 months ago
chenllliang / DnD-Transformer
View on GitHub
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆80Dec 10, 2024Updated last year
chen-wl20 / SceneCompleter
View on GitHub
SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis
☆36Jun 13, 2025Updated last year
VisionXLab / AdapTok
View on GitHub
[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆28Mar 15, 2026Updated 4 months ago
HelmholtzAI-FZJ / flex_gen
View on GitHub
☆20Jan 10, 2025Updated last year
LTH14 / mar
View on GitHub
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
☆1,942Feb 20, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
wangck20 / V2M
View on GitHub
☆27Oct 15, 2024Updated last year
WalterSimoncini / no-train-all-gain
View on GitHub
Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"
☆11Oct 31, 2024Updated last year
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,166Mar 20, 2025Updated last year
lxa9867 / ImageFolder
View on GitHub
High-performance Image Tokenizers for VAR and AR
☆307Apr 25, 2025Updated last year
lisiyao21 / Half-Physics
View on GitHub
Code for paper "Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions". Coming soon.
☆34Jul 31, 2025Updated 11 months ago
zelaki / eqvae
View on GitHub
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆181Mar 18, 2026Updated 4 months ago
yongliu20 / Awesome-Unified-Understanding-and-Generation
View on GitHub
☆52Aug 22, 2025Updated 11 months ago
AndyYuan96 / YZLFusion
View on GitHub
☆18Oct 6, 2022Updated 3 years ago
Euphoria16 / TL-Align
View on GitHub
[ICCV 2023]The PyTorch implementation of TL-Align: Token-Label Alignment for Vision Transformers.
☆23Jul 16, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MCG-NJU / DDT
View on GitHub
[CVPR 2026] DDT: Decoupled Diffusion Transformer
☆404May 22, 2026Updated 2 months ago
hywang66 / LARP
View on GitHub
Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).
☆107Feb 11, 2025Updated last year
chen-wl20 / GenWorld
View on GitHub
GenWorld: Towards Detecting AI-generated Real-world Simulation Videos
☆37Jun 13, 2025Updated last year
qihao067 / CrossFlow
View on GitHub
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…
☆343Jun 8, 2025Updated last year
shiml20 / FlowTurbo
View on GitHub
[TPAMI 26/ NeurIPS 24] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner…
☆75Oct 21, 2025Updated 9 months ago
OliverRensu / xAR
View on GitHub
This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…
☆251Oct 12, 2025Updated 9 months ago
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
colaudiolab / AudioSet-R
View on GitHub
Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"
☆19Oct 9, 2025Updated 9 months ago
Neur-IO / ReVQ
View on GitHub
Explore how to get a VQ-VAE models efficiently!
☆69Jul 24, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wzzheng / GaussianFormer
View on GitHub
Project Page for GaussianFormer
☆24May 30, 2024Updated 2 years ago
huang-yh / SpectralAR
View on GitHub
[ICCV 25]SpectralAR: Spectral Autoregressive Visual Generation
☆36Jun 13, 2025Updated last year
adobe-research / ImageFolder
View on GitHub
☆20Dec 8, 2024Updated last year
ShivamDuggal4 / adaptive-length-tokenizer
View on GitHub
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆146Feb 11, 2025Updated last year
OliverRensu / FlowAR
View on GitHub
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…
☆171May 1, 2025Updated last year
CarstenEpic / humos
View on GitHub
Humos paper repository
☆26Sep 6, 2025Updated 10 months ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year