OliverRensu / FlowAR
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.
☆88Updated last month
Alternatives and similar repositories for FlowAR:
Users that are interested in FlowAR are comparing it to the libraries listed below
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆47Updated 2 weeks ago
- This is the official implementation for ControlVAR.☆94Updated 2 months ago
- ☆132Updated last week
- ☆138Updated 2 months ago
- [ICLR25] High-performance Image Tokenizers for VAR and AR☆196Updated this week
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆45Updated 2 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆36Updated 4 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆50Updated last week
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated 3 weeks ago
- Open implementation of "RandAR"☆53Updated last month
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆102Updated 4 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆137Updated last week
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆59Updated 3 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆181Updated 4 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆78Updated 3 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆119Updated 3 weeks ago
- [arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆246Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆135Updated 8 months ago
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)☆101Updated last week
- ☆114Updated 7 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆42Updated last month
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆101Updated 2 months ago
- Towards training VQ-VAE models robustly!☆52Updated last month
- Denoising Diffusion Step-aware Models (ICLR2024)☆56Updated last year
- Scalable Diffusion Models with State Space Backbone☆151Updated 11 months ago
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆127Updated 8 months ago
- ☆43Updated 5 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆28Updated 3 months ago
- Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos☆49Updated 6 months ago