zhanghm1995 / Awesome-VAR
A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image generation.
☆31Updated 2 months ago
Alternatives and similar repositories for Awesome-VAR
Users that are interested in Awesome-VAR are comparing it to the libraries listed below
Sorting:
- PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models☆44Updated 3 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆134Updated last month
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆22Updated last month
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆15Updated last week
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆29Updated 9 months ago
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆10Updated 11 months ago
- Official Github Repo for GEM☆52Updated 2 weeks ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆78Updated last month
- ☆38Updated 9 months ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆28Updated 9 months ago
- ☆29Updated 8 months ago
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆77Updated 3 weeks ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆78Updated 3 weeks ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆16Updated 2 months ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆32Updated 3 months ago
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆125Updated last month
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆32Updated 3 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'☆56Updated 4 months ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆85Updated 4 months ago
- Curated list of recent visual autoregressive (VAR) modeling works☆30Updated last month
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆164Updated 2 months ago
- This is the official implementation for ControlVAR.☆107Updated 5 months ago
- Generate one 2K image on single 3090 GPU!☆31Updated last month
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆39Updated last year
- Project Page for GaussianFormer☆25Updated 11 months ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated 10 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆75Updated last year
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆138Updated last month
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆97Updated last month
- [ECCV 2024] Official implementation of "RangeLDM: Fast Realistic LiDAR Point Cloud Generation"☆32Updated 5 months ago