lxa9867 / ControlVARLinks

This is the official implementation for ControlVAR.

☆117

Alternatives and similar repositories for ControlVAR

Users that are interested in ControlVAR are comparing it to the libraries listed below

Sorting:

MiracleDance / CAR
CAR: Controllable AutoRegressive Modeling for Visual Generation
☆121Updated 8 months ago
daixiangzi / VAR-CLIP
Implements VAR+CLIP for text-to-image (T2I) generation
☆145Updated 6 months ago
Davinci-XLab / STAR-T2I
Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"
☆36Updated 4 months ago
ziqipang / RandAR
[CVPR 2025 (Oral)] Open implementation of "RandAR"
☆182Updated 3 weeks ago
OliverRensu / xAR
This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…
☆228Updated 3 months ago
maxin-cn / Awesome-Autoregressive-Visual-Generation-Models
a collection of awesome autoregressive visual generation models
☆76Updated 3 months ago
lxa9867 / ImageFolder
High-performance Image Tokenizers for VAR and AR
☆279Updated 3 months ago
yuhuUSTC / FAR
Frequency Autoregressive Image Generation with Continuous Tokens
☆81Updated last month
vvvvvjdy / SRA
(SRA) No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
☆77Updated 2 weeks ago
krennic999 / STAR
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
☆145Updated 5 months ago
KwaiVGI / DiffMoE
PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT
☆121Updated 3 months ago
hustvl / ControlAR
[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models
☆282Updated 3 months ago
hp-l33 / ARPG
Autoregressive Image Generation with Randomized Parallel Decoding
☆70Updated 4 months ago
OliverRensu / FlowAR
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…
☆141Updated 3 months ago
End2End-Diffusion / REPA-E
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆315Updated 3 weeks ago
markweberdev / maskbit
Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"
☆83Updated 3 months ago
showlab / FAR
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
☆234Updated 3 months ago
wusize / OpenUni
☆144Updated last month
czg1225 / CoDe
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆104Updated 4 months ago
wyf0912 / AREdit
Training-Free Text-Guided Image Editing Using Visual Autoregressive Model
☆54Updated 3 months ago
PKU-YuanGroup / WISE
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
☆136Updated last month
FoundationVision / vaex
🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook
☆97Updated last year
YuqingWang1029 / PAR
[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project
☆169Updated 4 months ago
zijieli-Jlee / Dual-Diffusion
Code for D-DiT
☆44Updated 4 months ago
DCDmllm / AnyEdit
【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
☆174Updated 4 months ago
PKU-YuanGroup / WF-VAE
[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
☆165Updated 2 months ago
wusize / Harmon
[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
☆145Updated 2 months ago
xie-lab-ml / Zigzag-Diffusion-Sampling
[ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflectio…
☆81Updated 5 months ago
SingleZombie / AFLDM
[CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)
☆94Updated 2 months ago
NUS-HPC-AI-Lab / Dynamic-Diffusion-Transformer
☆84Updated 4 months ago