apple / ml-flextokLinks

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

☆234

Alternatives and similar repositories for ml-flextok

Users that are interested in ml-flextok are comparing it to the libraries listed below

Sorting:

LINs-lab / UCGM
[Preprint] UCGM: Unified Continuous Generative Models
☆165Updated 2 months ago
MCG-NJU / DDT
DDT: Decoupled Diffusion Transformer
☆267Updated last month
qihao067 / CrossFlow
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…
☆287Updated last month
OliverRensu / xAR
This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…
☆228Updated 3 months ago
yinboc / dito
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆129Updated 6 months ago
Hhhhhhao / continuous_tokenizer
☆239Updated 2 months ago
zelaki / eqvae
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆139Updated last month
End2End-Diffusion / REPA-E
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆313Updated 3 weeks ago
causalfusion / causalfusion
☆175Updated 7 months ago
tzco / Diffusion-wo-CFG
Official Implementation for Diffusion Models Without Classifier-free Guidance
☆141Updated 5 months ago
visual-gen / semanticist
(ICCV 2025) "Principal Components" Enable A New Language of Images
☆54Updated last week
SilentView / GigaTok
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆176Updated last month
tang-bd / fuse-dit
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
☆113Updated 2 months ago
helblazer811 / ConceptAttention
ConceptAttention: A method for interpreting multi-modal diffusion transformers.
☆314Updated 3 months ago
OliverRensu / FlowAR
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…
☆141Updated 3 months ago
CompVis / discrete-interpolants
The official implementation of "[MASK] is All You Need"
☆122Updated last week
alexanderswerdlow / unidisc
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆112Updated 4 months ago
ShivamDuggal4 / adaptive-length-tokenizer
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆126Updated 5 months ago
YuqingWang1029 / TokenBridge
[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…
☆131Updated last week
facebookresearch / metamorph
Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning
☆199Updated 3 months ago
YuqingWang1029 / PAR
[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project
☆169Updated 4 months ago
lukaslaobeyer / token-opt
Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"
☆148Updated last month
CompVis / tread
☆58Updated 3 weeks ago
forever208 / DCTdiff
[ICML 2025] Official code for the paper 'DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space'
☆48Updated this week
microsoft / Reducio-VAE
☆202Updated 5 months ago
hp-l33 / AiM
Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"
☆139Updated 6 months ago
yhli123 / Immiscible-Diffusion
Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment
☆56Updated last month
hywang66 / LARP
Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).
☆79Updated 5 months ago
zhuyu-cs / MeanFlow
Pytorch implementation of MeanFlow on ImageNet and CIFAR10
☆185Updated this week
ShoufaChen / PixelFlow
Pixel-Space Generative Models
☆261Updated 2 months ago