zhanghm1995 / Awesome-VAR
A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image generation.
☆18Updated 2 weeks ago
Alternatives and similar repositories for Awesome-VAR:
Users that are interested in Awesome-VAR are comparing it to the libraries listed below
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆41Updated 2 months ago
- [ECCV 2024] Official implementation of "RangeLDM: Fast Realistic LiDAR Point Cloud Generation"☆25Updated last month
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆27Updated last month
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆42Updated 2 weeks ago
- ☆25Updated 3 weeks ago
- ☆44Updated last month
- Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models (NeurIPS2024)☆29Updated last month
- ☆62Updated 3 weeks ago
- ☆30Updated last month
- Project Page for GaussianFormer☆24Updated 7 months ago
- The official implementation of "Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation" (CVPR 2024)☆28Updated 5 months ago
- ☆31Updated 5 months ago
- ☆46Updated 11 months ago
- [WACV 2025] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆46Updated 9 months ago
- [NeurIPS 2024 Spotlight] Context and Geometry Aware Voxel Transformer for Semantic Scene Completion☆60Updated 3 weeks ago
- Official code of "Segment any 3D Object with Language"☆39Updated 8 months ago
- LiDAR Data Synthesis with Denoising Diffusion Probabilistic Models (ICRA 2024)☆56Updated 6 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆75Updated 5 months ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆67Updated last month
- DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes☆56Updated 2 months ago
- ☆28Updated 5 months ago
- Vispy-based NuScenes Visualization Toolkit☆14Updated last year
- [CVPR 2024] DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Iterative Diffusion-Based Refinement☆49Updated 3 months ago
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆37Updated 3 months ago
- ☆49Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆13Updated 6 months ago
- ☆69Updated last week
- [CVPR2024] Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset☆46Updated 6 months ago
- Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenarios"☆28Updated 2 months ago
- PyTorch code and models for ScaLR image-to-lidar distillation method☆47Updated 6 months ago