lxa9867 / ControlVAR
This is the official implementation for ControlVAR.
☆88Updated last month
Alternatives and similar repositories for ControlVAR:
Users that are interested in ControlVAR are comparing it to the libraries listed below
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆94Updated last month
- Implements VAR+CLIP for text-to-image (T2I) generation☆112Updated 2 weeks ago
- XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation☆178Updated last month
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆133Updated 7 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆93Updated 10 months ago
- ☆26Updated 5 months ago
- a collection of awesome autoregressive visual generation models☆63Updated 2 weeks ago
- Training-Free Condition-Guided Text-to-Video Generation☆59Updated last year
- ICCV2023-Diffusion-Papers☆109Updated last year
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆36Updated 3 months ago
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆82Updated 4 months ago
- ☆128Updated last month
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆72Updated 3 weeks ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆176Updated 3 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆88Updated 9 months ago
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆58Updated 2 months ago
- This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"☆91Updated this week
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆39Updated last month
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆44Updated last month
- This is a repo to track the latest autoregressive visual generation papers.☆103Updated 2 weeks ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆73Updated 6 months ago
- ☆124Updated 3 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆39Updated last week
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆101Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 6 months ago
- [CVPR2024] Official PyTorch implementation of "Contrastive Denoising Score(CDS) for Text-guided Latent Diffusion Image Editing"☆103Updated 2 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆73Updated last year
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆67Updated last month
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆127Updated 4 months ago