Latest Advances on Autoregressive Visual Models.π
β28Mar 15, 2025Updated last year
Alternatives and similar repositories for Awesome-Visual-Autoregressive-Model
Users that are interested in Awesome-Visual-Autoregressive-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generationβ37Aug 1, 2025Updated 7 months ago
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenesβ95Nov 26, 2025Updated 4 months ago
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognitionβ16Jan 21, 2025Updated last year
- [AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Trackingβ116May 18, 2025Updated 10 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption πβ46Jul 5, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generationβ71Feb 12, 2026Updated last month
- [ICLR 2026] MotionSight's official code implementation.β47Feb 13, 2026Updated last month
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.β86Jun 20, 2025Updated 9 months ago
- The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)β24May 26, 2025Updated 10 months ago
- a collection of awesome autoregressive visual generation modelsβ80Apr 17, 2025Updated 11 months ago
- [CVPR'24] Handwritten Mathematical Expressions Generation (HMEG)β31Jun 3, 2024Updated last year
- [CVPR 2025] Official code of "From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspβ¦β49Apr 2, 2025Updated 11 months ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image β¦β40Mar 2, 2025Updated last year
- official code for unigameβ19Nov 26, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Explicit Context Reasoning with Supervision for Visual Tracking (ACM MM 25)β18Jul 20, 2025Updated 8 months ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressiβ¦β28Aug 16, 2024Updated last year
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β620Dec 12, 2025Updated 3 months ago
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlapsβ26Updated this week
- β13Apr 5, 2020Updated 5 years ago
- Repository of Calculus (A) I Course Materials for the Autumn-Winter Semester of the 2024-2025 Academic Year at Zhejiang University.β10Jan 25, 2026Updated 2 months ago
- β35Feb 15, 2026Updated last month
- [Trans. on Graphics (ToG) 2024] Official code release for paper: π―"DARTS: Diffusion Approximated Residual Time Sampling for Time-of-fligβ¦β17Dec 24, 2024Updated last year
- [NIPS 2025] Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Controlβ46Dec 2, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This is the official repository of UltraHR-100K.β46Nov 21, 2025Updated 4 months ago
- β69Aug 13, 2025Updated 7 months ago
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representationsβ17Mar 31, 2025Updated 11 months ago
- Code and resources for SIGGRAPH 2023 paper NeuSample: Importance Sampling for Neural Materialsβ17Jul 15, 2024Updated last year
- β18May 15, 2025Updated 10 months ago
- Video2Tasks: Split multi-task robot videos into single-task segments with auto-generated instruction labels for VLA (pi0, OpenVLA) trainiβ¦β49Feb 28, 2026Updated last month
- β11Dec 28, 2023Updated 2 years ago
- In OLHWDB ,you can find the ptts files, this code can help you get the information of the pttsβ11Mar 8, 2022Updated 4 years ago
- [SIGGRAPH 2024] Temporally Stable Metropolis Light Transport Denoising using Recurrent Transformer Blocksβ19Jul 31, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- M5Product Main Page.β14Mar 12, 2022Updated 4 years ago
- Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>β32Jul 18, 2025Updated 8 months ago
- ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotationsβ34Apr 3, 2025Updated 11 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)β78Nov 1, 2024Updated last year
- Text-To-Image Generation with Chinese Charactersβ23Jan 16, 2026Updated 2 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learningβ53Jul 23, 2025Updated 8 months ago
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agentsβ103Mar 10, 2026Updated 2 weeks ago