zhanghm1995 / Awesome-VAR
A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image generation.
☆23Updated this week
Alternatives and similar repositories for Awesome-VAR:
Users that are interested in Awesome-VAR are comparing it to the libraries listed below
- PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models☆38Updated last month
- Project Page for GaussianFormer☆24Updated 8 months ago
- ☆76Updated last month
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆47Updated 10 months ago
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆42Updated last month
- ☆35Updated last month
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆113Updated 4 months ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆75Updated 2 months ago
- [ECCV 2024] Official implementation of "RangeLDM: Fast Realistic LiDAR Point Cloud Generation"☆26Updated 2 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆31Updated last month
- ☆31Updated 3 months ago
- ☆21Updated 10 months ago
- ☆37Updated 7 months ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated 7 months ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆42Updated 3 months ago
- GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆49Updated last week
- [ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving☆21Updated 2 months ago
- GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting☆19Updated 7 months ago
- This is the official implementation of "LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels" (Accepted at C…☆24Updated 8 months ago
- Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenarios"☆35Updated 3 months ago
- ☆44Updated 2 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆32Updated last month
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆37Updated 2 months ago
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆20Updated last week
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆75Updated 7 months ago
- Official Code Release of Delphi☆54Updated 8 months ago