lxa9867 / ControlVAR
This is the official implementation for ControlVAR.
☆94Updated 2 months ago
Alternatives and similar repositories for ControlVAR:
Users that are interested in ControlVAR are comparing it to the libraries listed below
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆99Updated 2 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆119Updated 3 weeks ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆86Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆135Updated 7 months ago
- [ICLR25] High-performance Image Tokenizers for VAR and AR☆194Updated this week
- ☆123Updated this week
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆46Updated 2 weeks ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated 3 weeks ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- This is a repo to track the latest autoregressive visual generation papers.☆137Updated this week
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆102Updated 4 months ago
- a collection of awesome autoregressive visual generation models☆65Updated 3 weeks ago
- Training-Free Condition-Guided Text-to-Video Generation☆62Updated last year
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆28Updated 3 weeks ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆94Updated 10 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆45Updated 2 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆74Updated 7 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆36Updated 4 months ago
- [ICLR2025]☆134Updated 2 weeks ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 7 months ago
- 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆249Updated last month
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆135Updated 2 weeks ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆74Updated last year
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆181Updated 4 months ago
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆97Updated 9 months ago
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆58Updated 3 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆188Updated 3 weeks ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆64Updated this week
- Official Code for NeurIPS 2023 Paper: CycleNet: Rethinking Cycle Consistent in Text‑Guided Diffusion for Image Manipulation☆82Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆59Updated 3 months ago