ZNan-Chen / Awesome-Visual-Autoregressive-Model
Latest Advances on Autoregressive Visual Models.π
β21Updated 2 weeks ago
Alternatives and similar repositories for Awesome-Visual-Autoregressive-Model:
Users that are interested in Awesome-Visual-Autoregressive-Model are comparing it to the libraries listed below
- CAR: Controllable AutoRegressive Modeling for Visual Generationβ107Updated 4 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionβ144Updated 2 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generationβ13Updated this week
- This is the official implementation for ControlVAR.β100Updated 3 months ago
- Official code for K-LoRA (CVPR 2025)β88Updated 3 weeks ago
- A collection of resources on personalized image generation.β95Updated last week
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.β36Updated this week
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformerβ52Updated last week
- Finetune your VAE on private datasets!β28Updated 9 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"β21Updated 3 weeks ago
- This is a repo to track the latest autoregressive visual generation papers.β193Updated this week
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longβ¦β243Updated 2 weeks ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generationβ72Updated this week
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".β148Updated 3 months ago
- a collection of awesome autoregressive visual generation modelsβ69Updated 3 weeks ago
- A Collection of AIGC Research Groupsβ69Updated 3 weeks ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animatorβ95Updated last year
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual interventionβ33Updated last week
- CCEdit: Creative and Controllable Video Editing via Diffusion Modelsβ107Updated 9 months ago
- [NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Modelsβ¦β123Updated 4 months ago
- This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"β100Updated 2 months ago
- Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"β65Updated this week
- Implements VAR+CLIP for text-to-image (T2I) generationβ131Updated 2 months ago
- Frequency Autoregressive Image Generation with Continuous Tokensβ42Updated 3 weeks ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsβ138Updated last month
- β77Updated 10 months ago
- β27Updated 4 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generationβ262Updated last month
- [ECCV2024] Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Modelsβ41Updated 8 months ago
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAEβ306Updated 2 months ago