hp-l33 / AiMLinks
Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"
β140Updated 8 months ago
Alternatives and similar repositories for AiM
Users that are interested in AiM are comparing it to the libraries listed below
Sorting:
- Transformer-Mamba Diffusion Modelsβ116Updated last year
- [CVPR 2025π₯] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Modelβ168Updated 4 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ171Updated 6 months ago
- DDT: Decoupled Diffusion Transformerβ275Updated last month
- [Preprint] UCGM: Unified Continuous Generative Modelsβ168Updated 3 months ago
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesisβ64Updated this week
- Autoregressive Image Generation with Randomized Parallel Decodingβ73Updated 5 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpeningβ65Updated 4 months ago
- Scalable Diffusion Models with State Space Backboneβ157Updated last year
- PixNerd: Pixel Neural Field Diffusionβ113Updated this week
- Pixel-Space Generative Modelsβ269Updated 4 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, aβ¦β121Updated 5 months ago
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Moβ¦β298Updated 3 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generatβ¦β234Updated 4 months ago
- Transition Modelsβ119Updated this week
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsβ146Updated 7 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flowsβ87Updated last month
- [ICCV2025] The code of our work "Golden Noise for Diffusion Models: A Learning Framework".β176Updated last month
- (SRA) No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselvesβ87Updated last month
- β70Updated 10 months ago
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuningβ95Updated 5 months ago
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)β97Updated 3 months ago
- This is the official implementation for ControlVAR.β121Updated 9 months ago
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformersβ341Updated 2 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.β152Updated 2 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidanceβ151Updated 7 months ago
- Distilling Diversity and Control in Diffusion Modelsβ45Updated 4 months ago
- [arxiv 25] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devicesβ51Updated 2 weeks ago
- β38Updated 3 months ago
- [ICML 2025] Official code for the paper 'DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space'β58Updated last month