ZNan-Chen/Awesome-Visual-Autoregressive-Model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZNan-Chen/Awesome-Visual-Autoregressive-Model)

ZNan-Chen / Awesome-Visual-Autoregressive-Model

Latest Advances on Autoregressive Visual Models.📖

☆28

Alternatives and similar repositories for Awesome-Visual-Autoregressive-Model

Users that are interested in Awesome-Visual-Autoregressive-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NJU-PCALab / CoDi
View on GitHub
CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation
☆36Aug 1, 2025Updated 11 months ago
NJU-PCALab / L2P
View on GitHub
L2P: Unlocking Latent Potential for Pixel Generation
☆39May 22, 2026Updated 2 months ago
NJU-PCALab / TextCrafter
View on GitHub
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
☆97Nov 26, 2025Updated 8 months ago
NJU-PCALab / UltraHR-100k
View on GitHub
This is the official repository of UltraHR-100K.
☆45Nov 21, 2025Updated 8 months ago
TencentYoutuResearch / T2I-L2P
View on GitHub
Code for "L2P: Unlocking Latent Potential for Pixel Generation"
☆179Jul 11, 2026Updated 2 weeks ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
NJU-PCALab / InstanceCap
View on GitHub
[CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍
☆45Jul 5, 2025Updated last year
LeyRio / MIG_Bench
View on GitHub
The MIG benchmark of CVPR2024 MIGC
☆15Mar 3, 2024Updated 2 years ago
EDM-Research / VATr-pp
View on GitHub
☆18Jul 9, 2024Updated 2 years ago
NJU-PCALab / MotionSight
View on GitHub
[ICLR 2026] MotionSight's official code implementation.
☆48Apr 24, 2026Updated 3 months ago
maxin-cn / Awesome-Autoregressive-Visual-Generation-Models
View on GitHub
a collection of awesome autoregressive visual generation models
☆82Apr 17, 2025Updated last year
River-Zhang / Awesome-FLUX-DiT
View on GitHub
A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.
☆86Jun 20, 2025Updated last year
mlpc-ucsd / OverLayBench
View on GitHub
(NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
☆27May 4, 2026Updated 2 months ago
jiahao-shao1 / notion-lifeos-skill
View on GitHub
Notion LifeOS PARA system — agent skill for Claude Code, OpenClaw, Codex and more
☆23Mar 24, 2026Updated 4 months ago
aimagelab / HWD
View on GitHub
☆27Mar 7, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JacobLinCool / sdxl-api
View on GitHub
SDXL API provides a seamless interface for image generation and retrieval using Stable Diffusion XL integrated with Cloudflare AI Workers…
☆14Feb 29, 2024Updated 2 years ago
singularityrms / OLHWG
View on GitHub
The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)
☆25May 26, 2025Updated last year
vahlok-alunmid / ComfyUI-ExtendIPAdapterClipVision
View on GitHub
An extension for ComfyUI to add IPAdapter nodes for clip vision model with different input size.
☆18Feb 9, 2025Updated last year
baptiste-genest / NESOTS
View on GitHub
Source code of the article "Non Euclidean Sliced Optimal Transort Sampling" published at Eurographics 2024, authors : Baptiste GENEST, Ni…
☆12Aug 28, 2024Updated last year
yuhuUSTC / FAR
View on GitHub
Frequency Autoregressive Image Generation with Continuous Tokens
☆101Jun 9, 2025Updated last year
AiArt-Gao / HMEG
View on GitHub
[CVPR'24] Handwritten Mathematical Expressions Generation (HMEG)
☆34Jun 3, 2024Updated 2 years ago
NJU-PCALab / ERR
View on GitHub
[CVPR 2025] Official code of "From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Persp…
☆60Apr 16, 2026Updated 3 months ago
Twinkle-ce / DescriptiveEdit_code
View on GitHub
[ICCV 2025] Official implementation for Describe, Don't Dictate: Semantic Image Editing with Natural Language Intent
☆15Nov 4, 2025Updated 8 months ago
lizhh268 / ShadowMaskFormer
View on GitHub
[TAI 2025] Official implementation of TAI-accepted paper: ShadowMaskFormer: Mask Augmented Patch Embedding for Shadow Removal
☆15May 8, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
banjiuyufen / Analysis-of-CASIA-OLHWDB-Dataset
View on GitHub
In OLHWDB ,you can find the ptts files, this code can help you get the information of the ptts
☆11Mar 8, 2022Updated 4 years ago
mo230761 / UniGeo
View on GitHub
A framework for camera-controllable image editing using unified geometric guidance and video models.
☆65Jun 25, 2026Updated last month
yuanhongyi / zjucalc24
View on GitHub
Repository of Calculus (A) I Course Materials for the Autumn-Winter Semester of the 2024-2025 Academic Year at Zhejiang University.
☆10Jun 2, 2026Updated last month
aimagelab / COGT
View on GitHub
[ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding
☆10Apr 15, 2025Updated last year
eren23 / neo-unify
View on GitHub
Toy-scale unified multimodal model experiments — encoder-free understanding & generation with Mixture-of-Transformers on MLX/Apple Silico…
☆47Mar 8, 2026Updated 4 months ago
Guohanzhong / GMS
View on GitHub
☆23May 7, 2024Updated 2 years ago
Ming-er / Audio-Free-P-Tuning
View on GitHub
☆11Dec 28, 2023Updated 2 years ago
yurujiang2003 / sparta
View on GitHub
NeurIPS 2025
☆15Feb 4, 2026Updated 5 months ago
walker-hyf / GPT-Talker
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆78Nov 1, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
weichow23 / AnySD
View on GitHub
Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>
☆34Jul 18, 2025Updated last year
1073521013 / GlyphDraw
View on GitHub
Text-To-Image Generation with Chinese Characters
☆23Jan 16, 2026Updated 6 months ago
CzzzzH / MLTD
View on GitHub
[SIGGRAPH 2024] Temporally Stable Metropolis Light Transport Denoising using Recurrent Transformer Blocks
☆20Jul 31, 2024Updated last year
lxa9867 / Awesome-Autoregressive-Visual-Generation
View on GitHub
This is a repo to track the latest autoregressive visual generation papers.
☆430Jun 25, 2025Updated last year
foreverlasting1202 / QuestA
View on GitHub
☆22Jan 2, 2026Updated 6 months ago
Style3D / FashionR2R
View on GitHub
☆32Oct 23, 2024Updated last year
nenhang / ContextGen
View on GitHub
[ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation
☆85Apr 19, 2026Updated 3 months ago