FoundationVision/BitVAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FoundationVision/BitVAE)

FoundationVision / BitVAE

official training and inference code of bitwise tokenizer

☆71

Alternatives and similar repositories for BitVAE

Users that are interested in BitVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVlabs / HMAR
View on GitHub
[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
☆63Jul 8, 2025Updated last year
FoundationVision / Infinity
View on GitHub
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆1,579Apr 16, 2026Updated 3 months ago
FoundationVision / InfinityStar
View on GitHub
[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation
☆772Apr 16, 2026Updated 3 months ago
ByteVisionLab / DetailFlow
View on GitHub
🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
☆170Jul 10, 2025Updated last year
showlab / TPDiff
View on GitHub
TPDiff: Temporal Pyramid Video Diffusion Model
☆25Mar 13, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
lxa9867 / ImageFolder
View on GitHub
High-performance Image Tokenizers for VAR and AR
☆307Apr 25, 2025Updated last year
lxa9867 / ControlVAR
View on GitHub
This is the official implementation for ControlVAR.
☆128Dec 10, 2024Updated last year
yinboc / dito
View on GitHub
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆168Jan 31, 2025Updated last year
IamCreateAI / CycleVAR
View on GitHub
[ICCV 2025] CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation
☆18Jul 7, 2025Updated last year
mmderakhshani / NeoBabel
View on GitHub
Official implementation of the paper: "NeoBabel: A Multilingual Open Tower for Visual Generation"
☆25Aug 4, 2025Updated 11 months ago
NVlabs / DDO
View on GitHub
[ICML 2025 Spotlight] Direct Discriminative Optimization: Reinforcing Diffusion/Autoregressive with GAN Discrimination
☆124Jan 27, 2026Updated 5 months ago
22109095 / SimOWT
View on GitHub
This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.
☆10Jan 26, 2024Updated 2 years ago
ByteVisionLab / TokenFlow
View on GitHub
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
☆464Aug 8, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
VisionXLab / AdapTok
View on GitHub
[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆28Mar 15, 2026Updated 4 months ago
X-Omni-Team / X-Omni
View on GitHub
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
☆426Aug 26, 2025Updated 10 months ago
shaochenze / EAR
View on GitHub
☆42May 15, 2025Updated last year
JitengMu / EditAR
View on GitHub
EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)
☆44Jun 13, 2025Updated last year
zelaki / eqvae
View on GitHub
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆181Mar 18, 2026Updated 4 months ago
HiDream-ai / VAREdit
View on GitHub
☆105Feb 4, 2026Updated 5 months ago
Neur-IO / ReVQ
View on GitHub
Explore how to get a VQ-VAE models efficiently!
☆69Jul 24, 2025Updated 11 months ago
wdrink / SimpleAR
View on GitHub
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
☆431Jun 20, 2025Updated last year
tang-bd / fuse-dit
View on GitHub
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
☆140May 16, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
quyp2000 / VARSR
View on GitHub
[ICML2025] VARSR: Visual Autogressive Modeling for Image Super Resolution
☆177May 1, 2025Updated last year
FoundationVision / vaex
View on GitHub
🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook
☆108Jun 23, 2024Updated 2 years ago
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
qingshi9974 / ECCV2024-AdpatICMH
View on GitHub
[ECCV2024] Image Compression for Machine and Human Vision With Spatial-Frequency Adaptation
☆55Mar 24, 2025Updated last year
NVlabs / QLIP
View on GitHub
[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation
☆97Mar 1, 2025Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
huawei-lin / VTBench
View on GitHub
This repository provides the official implementation of VTBench, a benchmark designed to evaluate the performance of visual tokenizers (V…
☆35Jul 30, 2025Updated 11 months ago
tongdaxu / Making-rFID-Predictive-of-Diffusion-gFID
View on GitHub
Predicting the generation FID of latent diffusion, with a variant of reconstruction FID of Variational Auto-encoder.
☆84Jun 15, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Tencent / HaploVLM
View on GitHub
ICML2025
☆63Aug 28, 2025Updated 10 months ago
mit-han-lab / hart
View on GitHub
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
☆647Oct 16, 2024Updated last year
CUC-MIPG / VQGAN-Compression
View on GitHub
Extreme Image Compression using Fine-tuned VQGAN Models (DCC 2024)
☆24Jan 14, 2025Updated last year
effl-lab / TACO
View on GitHub
Official Implementation of "Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity (ICML 2024)"
☆44Aug 28, 2024Updated last year
Open-Model-Initiative / imagegen-speedrun
View on GitHub
We bring the spirit of nanogpt-speedrun into the omni-modal world
☆15Jan 31, 2026Updated 5 months ago
stepfun-ai / NextStep-1
View on GitHub
[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …
☆689Feb 27, 2026Updated 4 months ago
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago