ali-vilab/alitok

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ali-vilab/alitok)

ali-vilab / alitok

[ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

☆56

Alternatives and similar repositories for alitok

Users that are interested in alitok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YuqingWang1029 / TokenBridge
View on GitHub
[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…
☆158Jul 24, 2025Updated last year
ali-vilab / FACM
View on GitHub
FACM: Flow-Anchored Consistency Models
☆147Aug 6, 2025Updated 11 months ago
markweberdev / maskbit
View on GitHub
Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"
☆94Apr 10, 2025Updated last year
ali-vilab / ViewPoint
View on GitHub
[NeurIPS 2025] ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models
☆34Jul 1, 2025Updated last year
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LINs-lab / UCGM
View on GitHub
[Preprint] UCGM: Unified Continuous Generative Models
☆185May 27, 2025Updated last year
wjf5203 / TokBench
View on GitHub
Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.
☆152Jun 11, 2026Updated last month
apple / ml-flextok
View on GitHub
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆322Jun 2, 2025Updated last year
ali-vilab / iv-vae
View on GitHub
☆34Mar 4, 2025Updated last year
b04901014 / vae-gslm
View on GitHub
Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models
☆24Jun 18, 2025Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
Hhhhhhao / continuous_tokenizer
View on GitHub
☆321May 29, 2025Updated last year
thu-ml / GFT
View on GitHub
☆53Jun 13, 2025Updated last year
LINs-lab / GMem
View on GitHub
[Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models
☆43Mar 11, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
fluxions-ai / stftvae
View on GitHub
Inference for the STFT-VAE continuous audio codec (24kHz, 3.125Hz latent)
☆43Jul 12, 2026Updated 2 weeks ago
MKJia / MGVQ
View on GitHub
[Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization
☆55Sep 16, 2025Updated 10 months ago
mit-han-lab / lpd
View on GitHub
[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆104May 8, 2026Updated 2 months ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,167Mar 20, 2025Updated last year
lxa9867 / ImageFolder
View on GitHub
High-performance Image Tokenizers for VAR and AR
☆307Apr 25, 2025Updated last year
ByteVisionLab / DetailFlow
View on GitHub
🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
☆170Jul 10, 2025Updated last year
OliverRensu / xAR
View on GitHub
This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…
☆251Oct 12, 2025Updated 9 months ago
P1ping / TokAN-Legacy
View on GitHub
☆27Jun 22, 2026Updated last month
YWolfeee / InfoTok
View on GitHub
Codebase for InfoTok: Adaptive Discrete Video Tokenizer via Information-Theoretic Compression
☆53Mar 18, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
KdaiP / DC-Speech-VAE
View on GitHub
5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs
☆57Nov 19, 2025Updated 8 months ago
huawei-lin / VTBench
View on GitHub
This repository provides the official implementation of VTBench, a benchmark designed to evaluate the performance of visual tokenizers (V…
☆35Jul 30, 2025Updated 11 months ago
ZhengrongYue / UniFlow
View on GitHub
Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"
☆143Oct 17, 2025Updated 9 months ago
zhuangshaobin / WeTok
View on GitHub
[ICLR2026] WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction
☆70Sep 3, 2025Updated 10 months ago
Eps-Acoustic-Revolution-Lab / DUO_TOK
View on GitHub
Official repository for “Duo-Tok: Dual-Track Semantic Music Tokenizer for Vocal–Accompaniment Generation.”
☆32Nov 26, 2025Updated 8 months ago
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,018Nov 25, 2025Updated 8 months ago
showlab / D-AR
View on GitHub
the official repo for "D-AR: Diffusion via Autoregressive Models"
☆138Jan 29, 2026Updated 5 months ago
tzco / Diffusion-wo-CFG
View on GitHub
Official Implementation for Diffusion Models Without Classifier-free Guidance
☆175Feb 18, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
CVL-UESTC / MVAR
View on GitHub
ICLR 2026-MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning
☆38Apr 17, 2026Updated 3 months ago
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago
HaozheZhao / MENTOR
View on GitHub
☆31Jul 16, 2025Updated last year
MKJia / DINO-Tok
View on GitHub
[Arxiv'25] DINO-Tok: Adapting DINO for Visual Tokenizers
☆40Apr 11, 2026Updated 3 months ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
X-Omni-Team / X-Omni
View on GitHub
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
☆426Aug 26, 2025Updated 11 months ago
zh460045050 / VQGAN-LC
View on GitHub
☆145Jun 28, 2024Updated 2 years ago