ShivamDuggal4/UNITE-tokenization-generation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ShivamDuggal4/UNITE-tokenization-generation)

ShivamDuggal4 / UNITE-tokenization-generation

Single-stage End-to-End Training for Tokenization and Generation

☆117

Alternatives and similar repositories for UNITE-tokenization-generation

Users that are interested in UNITE-tokenization-generation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nanovisionx / RAEv2
View on GitHub
Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders
☆308May 21, 2026Updated last month
Jiawei-Yang / FD-Loss
View on GitHub
☆544May 1, 2026Updated 2 months ago
black-forest-labs / Self-Flow
View on GitHub
[ICML'26] Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
☆534May 23, 2026Updated last month
zelaki / eqvae
View on GitHub
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆181Mar 18, 2026Updated 4 months ago
ShivamDuggal4 / adaptive-length-tokenizer
View on GitHub
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆146Feb 11, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Jiawei-Yang / DeTok
View on GitHub
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
☆195Feb 24, 2026Updated 4 months ago
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
xingjian-bai / sparse-causal-diffusion
View on GitHub
☆46Feb 20, 2026Updated 5 months ago
apple / ml-flextok
View on GitHub
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆321Jun 2, 2025Updated last year
Zehong-Ma / PixelGen
View on GitHub
Official repository for “PixelGen: Improving Pixel Diffusion with Perceptual Loss”
☆273May 12, 2026Updated 2 months ago
visual-gen / semanticist
View on GitHub
(ICCV 2025) "Principal Components" Enable A New Language of Images
☆86Jun 4, 2026Updated last month
shiml20 / SVG
View on GitHub
[ICLR 2026] Official PyTorch Implementation of "Latent Diffusion Model Without Variational Autoencoder".
☆457Dec 15, 2025Updated 7 months ago
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆738Updated this week
End2End-Diffusion / REPA-E
View on GitHub
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆511Dec 6, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ShivamDuggal4 / karl
View on GitHub
Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?
☆43Jul 26, 2025Updated 11 months ago
Lyy-iiis / pMF
View on GitHub
Official Implementation of pMF https://arxiv.org/abs/2601.22158
☆268Feb 19, 2026Updated 5 months ago
CompVis / RepTok
View on GitHub
[ICLR 2026] Adapting Self-Supervised Representations as a Latent Space for Efficient Generation
☆59Apr 24, 2026Updated 2 months ago
MCG-NJU / DDT
View on GitHub
[CVPR 2026] DDT: Decoupled Diffusion Transformer
☆403May 22, 2026Updated last month
End2End-Diffusion / iREPA
View on GitHub
[ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?
☆256Dec 15, 2025Updated 7 months ago
alibaba / OmniDoc-TokenBench
View on GitHub
☆69May 14, 2026Updated 2 months ago
csslc / Self-Transcendence
View on GitHub
[ECCV 2026] Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Trans…
☆36Jul 3, 2026Updated 2 weeks ago
ByteDance-Seed / Adversarial-Flow-Models
View on GitHub
☆84Apr 18, 2026Updated 3 months ago
amazon-far / deltatok
View on GitHub
[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
☆208Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,977Feb 25, 2026Updated 4 months ago
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,507Dec 16, 2025Updated 7 months ago
sii-research / GAE
View on GitHub
Official code of Geometric Autoencoder for Diffusion Models.
☆20Mar 12, 2026Updated 4 months ago
zelaki / ReDi
View on GitHub
[NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis
☆120Nov 3, 2025Updated 8 months ago
yinboc / dito
View on GitHub
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆168Jan 31, 2025Updated last year
tang-bd / v-grpo
View on GitHub
[CVPR 2026 Findings] V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think
☆56Apr 28, 2026Updated 2 months ago
KlingAIResearch / SVG-T2I
View on GitHub
[Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder…
☆152Dec 18, 2025Updated 7 months ago
LTH14 / JiT
View on GitHub
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
☆2,459Dec 8, 2025Updated 7 months ago
MiniMax-AI / VTP
View on GitHub
[ECCV 2026] Towards Scalable Pre-training of Visual Tokenizers for Generation
☆495Apr 15, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tongdaxu / Making-rFID-Predictive-of-Diffusion-gFID
View on GitHub
Predicting the generation FID of latent diffusion, with a variant of reconstruction FID of Variational Auto-encoder.
☆84Jun 15, 2026Updated last month
vvvvvjdy / SRA
View on GitHub
[ICLR 2026] Self-Representation Alignment for Diffusion Transformers (SRA)
☆144Jul 3, 2026Updated 2 weeks ago
Hope7Happiness / minit2i-torch
View on GitHub
Official PyTorch re-implementation of MiniT2I.
☆285Jun 24, 2026Updated 3 weeks ago
End2End-Diffusion / diffusion-bench
View on GitHub
Towards Holistic evaluation of Generative Diffusion Transformers!
☆98Jul 1, 2026Updated 2 weeks ago
SwayStar123 / SpeedrunDiT
View on GitHub
SR-DiT Speedrunning ImageNet Diffusion
☆139Apr 6, 2026Updated 3 months ago
showlab / D-AR
View on GitHub
the official repo for "D-AR: Diffusion via Autoregressive Models"
☆138Jan 29, 2026Updated 5 months ago
NVlabs / AnyFlow
View on GitHub
Flow Map OPD for AnyStep Video Diffusion
☆394May 23, 2026Updated last month