hp-l33 / AiMLinks

Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"

☆137

Alternatives and similar repositories for AiM

Users that are interested in AiM are comparing it to the libraries listed below

Sorting:

feizc / Dimba
Transformer-Mamba Diffusion Models
☆110Updated last year
LINs-lab / UCGM
[Preprint] UCGM: Unified Continuous Generative Models
☆161Updated last month
YuqingWang1029 / PAR
[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project
☆166Updated 3 months ago
MCG-NJU / DDT
DDT: Decoupled Diffusion Transformer
☆264Updated last week
hp-l33 / ARPG
Autoregressive Image Generation with Randomized Parallel Decoding
☆68Updated 3 months ago
zelaki / eqvae
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆126Updated 2 weeks ago
PKU-YuanGroup / WF-VAE
[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
☆156Updated 2 months ago
feizc / DiS
Scalable Diffusion Models with State Space Backbone
☆155Updated last year
End2End-Diffusion / REPA-E
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆301Updated 3 months ago
Gen-Verse / Diffusion-Sharpening
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
☆62Updated last month
OliverRensu / xAR
This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…
☆221Updated 2 months ago
alexanderswerdlow / unidisc
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆110Updated 3 months ago
tzco / Diffusion-wo-CFG
Official Implementation for Diffusion Models Without Classifier-free Guidance
☆137Updated 4 months ago
qihao067 / CrossFlow
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…
☆283Updated last month
ShoufaChen / PixelFlow
Pixel-Space Generative Models
☆255Updated 2 months ago
SilentView / GigaTok
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆166Updated 2 weeks ago
Litalby1 / make-it-count
Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)
☆78Updated 4 months ago
lxa9867 / ControlVAR
This is the official implementation for ControlVAR.
☆116Updated 7 months ago
Hhhhhhao / continuous_tokenizer
☆211Updated last month
qihao067 / DiMR
[NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
☆40Updated 9 months ago
CompVis / discrete-interpolants
The official implementation of "[MASK] is All You Need"
☆121Updated 4 months ago
shiml20 / FlowTurbo
[NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"
☆71Updated 9 months ago
krennic999 / STAR
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
☆143Updated 4 months ago
causalfusion / causalfusion
☆174Updated 6 months ago
xie-lab-ml / Golden-Noise-for-Diffusion-Models
[ICCV2025] The code of our work "Golden Noise for Diffusion Models: A Learning Framework".
☆158Updated 3 weeks ago
zelaki / ReDi
Boosting Generative Image Modeling via Joint Image-Feature Synthesis
☆47Updated 3 weeks ago
OliverRensu / MVAR
☆70Updated 7 months ago
haoningwu3639 / MegaFusion
[WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
☆92Updated 3 months ago
SingleZombie / AFLDM
[CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)
☆91Updated last month
feizc / Diffusion-RWKV
Scaling RWKV-Like Architectures for Diffusion Models
☆135Updated last year