Latest Advances on Autoregressive Visual Models.📖
☆28Mar 15, 2025Updated last year
Alternatives and similar repositories for Awesome-Visual-Autoregressive-Model
Users that are interested in Awesome-Visual-Autoregressive-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation☆37Aug 1, 2025Updated 9 months ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆96Nov 26, 2025Updated 5 months ago
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆16Jan 21, 2025Updated last year
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking☆118May 18, 2025Updated 11 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆46Jul 5, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Jul 9, 2024Updated last year
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation☆77Apr 19, 2026Updated 2 weeks ago
- [ICLR 2026] MotionSight's official code implementation.☆47Apr 24, 2026Updated 2 weeks ago
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆86Jun 20, 2025Updated 10 months ago
- The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)☆24May 26, 2025Updated 11 months ago
- a collection of awesome autoregressive visual generation models☆80Apr 17, 2025Updated last year
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆59Apr 28, 2026Updated last week
- Frequency Autoregressive Image Generation with Continuous Tokens☆98Jun 9, 2025Updated 11 months ago
- ☆27Mar 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆40Mar 2, 2025Updated last year
- [CVPR'26] UniGame code implementation☆19Apr 21, 2026Updated 2 weeks ago
- [CVPR 2025] Official code of "From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Persp…☆55Apr 16, 2026Updated 3 weeks ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆29Aug 16, 2024Updated last year
- Explicit Context Reasoning with Supervision for Visual Tracking (ACM MM 25)☆18Jul 20, 2025Updated 9 months ago
- ☆16Nov 28, 2024Updated last year
- Source code of the article "Non Euclidean Sliced Optimal Transort Sampling" published at Eurographics 2024, authors : Baptiste GENEST, Ni…☆12Aug 28, 2024Updated last year
- A fully automated, intelligent photo-editing agent that autonomously plans multi-step aesthetic enhancements, smartly chooses diverse edi…☆44Mar 12, 2026Updated last month
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥☆619Dec 12, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆26Updated this week
- 八股杀手,专杀八股!基于大模型的大厂面经八股采集器,一键完成采集到筛选到成文全流程,支持自定义关键词,让所有八股都无处遁形!☆81Apr 19, 2026Updated 3 weeks ago
- ☆13Apr 5, 2020Updated 6 years ago
- Repository of Calculus (A) I Course Materials for the Autumn-Winter Semester of the 2024-2025 Academic Year at Zhejiang University.☆10Jan 25, 2026Updated 3 months ago
- [CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner☆180Feb 22, 2026Updated 2 months ago
- FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution☆38Updated this week
- ☆39Feb 15, 2026Updated 2 months ago
- [Trans. on Graphics (ToG) 2024] Official code release for paper: 🎯"DARTS: Diffusion Approximated Residual Time Sampling for Time-of-flig…☆18Dec 24, 2024Updated last year
- [NIPS 2025] Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control☆47Apr 1, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is a repo to track the latest autoregressive visual generation papers.☆431Jun 25, 2025Updated 10 months ago
- This is the official repository of UltraHR-100K.☆46Nov 21, 2025Updated 5 months ago
- ☆70Aug 13, 2025Updated 8 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 10 months ago
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated last year
- ☆18May 15, 2025Updated 11 months ago
- [ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding☆10Apr 15, 2025Updated last year