JCZ404/Awesome-Visual-Autoregressive

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JCZ404/Awesome-Visual-Autoregressive)

JCZ404 / Awesome-Visual-Autoregressive

Curated list of recent visual autoregressive (VAR) modeling works

☆30

Alternatives and similar repositories for Awesome-Visual-Autoregressive

Users that are interested in Awesome-Visual-Autoregressive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Visual-AI / Dissect-OOD-OSR
View on GitHub
[IJCV 2024] Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks
☆15Aug 30, 2024Updated last year
zhanghm1995 / Awesome-VQGAN
View on GitHub
Collect papers and codes about VQGAN in various Computer Vision tasks
☆10Dec 20, 2022Updated 3 years ago
Visual-AI / HiLo
View on GitHub
[ICLR2025] HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts
☆21Aug 1, 2025Updated 9 months ago
Visual-AI / GAMEBoT
View on GitHub
[ACL 2025] GAMEBoT: Transparent Assessment of LLM Reasoning in Games
☆32May 15, 2026Updated last week
woo0818 / SceneMI
View on GitHub
☆33Oct 17, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CurryYuan / X-Trans2Cap
View on GitHub
[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
☆36Aug 26, 2022Updated 3 years ago
HVision-NKU / MaskDiffusion
View on GitHub
☆12Dec 7, 2024Updated last year
HVision-NKU / ControlSR
View on GitHub
☆13Apr 19, 2025Updated last year
Aaron617 / text2world
View on GitHub
[ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation
☆29Feb 25, 2025Updated last year
KaiyueSun98 / T2I-Personalization-with-AR
View on GitHub
☆47Apr 20, 2025Updated last year
yongliu20 / Awesome-Unified-Understanding-and-Generation
View on GitHub
☆53Aug 22, 2025Updated 9 months ago
qinghuannn / ChainHOI
View on GitHub
☆15May 1, 2025Updated last year
harshbhatt7585 / StillMoving
View on GitHub
☆17Jul 30, 2024Updated last year
vvvvvjdy / D-OPSD
View on GitHub
Official Repo of "D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models"
☆137May 13, 2026Updated last week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆523Nov 14, 2025Updated 6 months ago
JudgementH / RefAny3D
View on GitHub
[ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation
☆34Mar 10, 2026Updated 2 months ago
wangyePHD / OmniStyle
View on GitHub
OmniStyle: Filtering High Quality Style Transfer Data at Scale (CVPR 2025)
☆34Aug 9, 2025Updated 9 months ago
showlab / UniRL
View on GitHub
The code repository of UniRL
☆52May 30, 2025Updated 11 months ago
deepffff / SADis
View on GitHub
The code of the paper "Free-Lunch Color-Texture Disentanglement for Stylized Image Generation"
☆36Sep 18, 2025Updated 8 months ago
Abdennacer-Badaoui / D3PMs
View on GitHub
A small project that uses Discrete Denoising Diffusion Probabilistic Models (D3PMs), a generative model for discrete data that builds upo…
☆15Aug 10, 2024Updated last year
jiutiancv / JV-CV-T2V
View on GitHub
☆12Sep 24, 2024Updated last year
mini-sora / MiniSora-DiT
View on GitHub
minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora
☆39Mar 25, 2024Updated 2 years ago
Visual-AI / RegionDrag
View on GitHub
[ECCV2024] RegionDrag: Fast Region-Based Image Editing with Diffusion Models
☆66Oct 9, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhanghm1995 / Awesome-VAR
View on GitHub
A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …
☆41Mar 2, 2025Updated last year
HawHello / Motion-Occupancy-Base
View on GitHub
Official repository for gathering data of Revisit Human-Scene Interaction via Space Occupancy (ECCV 2024).
☆30Sep 29, 2024Updated last year
VClinic / VClinic
View on GitHub
A portable and efficient infrastracture for value profilers. Doc: https://vclinic.readthedocs.io/en/latest/index.html
☆14Mar 4, 2026Updated 2 months ago
Visual-AI / PromptCCD
View on GitHub
[ECCV2024] PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery
☆30Apr 3, 2025Updated last year
ziqipang / RandAR
View on GitHub
[CVPR 2025 (Oral)] Open implementation of "RandAR"
☆207Jul 14, 2025Updated 10 months ago
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
MMMGBench / MMMG
View on GitHub
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]
☆24Dec 10, 2025Updated 5 months ago
CookSleep / EasyTTS
View on GitHub
EasyTTS是一个便捷的工具，旨在方便地使用第三方API服务来调用OpenAI的文本转语音（TTS）功能。 EasyTTS允许用户输入文本，并选择不同的模型、音色、格式来生成音频文件。
☆10Nov 26, 2023Updated 2 years ago
aiwaves-cn / Dive-into-LLMs
View on GitHub
The official github repo for the open online courses: "Dive into LLMs".
☆10Mar 15, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
lixiaoyu2000 / HAT
View on GitHub
Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"
☆30Mar 25, 2026Updated last month
synsin0 / COME
View on GitHub
Adding Scene-Centric Forecasting Control to Occupancy World Model
☆42Aug 24, 2025Updated 9 months ago
PKU-YuanGroup / UniWorld
View on GitHub
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
☆876Dec 23, 2025Updated 5 months ago
guo-yu / express-scaffold
View on GitHub
Simple sexy scaffold for Express
☆44Jan 16, 2015Updated 11 years ago
t2ac32 / PSNR-HVS-M-for-python
View on GitHub
A python implementation of PSNR that takes the Human visual system into account.
☆13Apr 9, 2026Updated last month
VIPL-GENUN / JoPano
View on GitHub
JoPano: Unified Panorama Generation via Joint Modeling
☆24Mar 6, 2026Updated 2 months ago
dl-container-registry / furnari-flow
View on GitHub
Antonino Furnari's fork of Feichtenhofer's gpu_flow, with temporal dilation.
☆10Sep 18, 2020Updated 5 years ago