hustvl / GaussTR
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
☆42Updated this week
Alternatives and similar repositories for GaussTR:
Users that are interested in GaussTR are comparing it to the libraries listed below
- ☆62Updated 3 weeks ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆67Updated last month
- [ECCV 2024] Occupancy as Set of Points☆86Updated 6 months ago
- ☆25Updated 3 weeks ago
- ☆29Updated last month
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆73Updated 3 weeks ago
- Project Page for GaussianFormer☆24Updated 7 months ago
- WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆92Updated 3 months ago
- Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction☆13Updated last month
- DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes☆56Updated 2 months ago
- ☆27Updated 4 months ago
- ☆90Updated last month
- [WACV 2025] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆46Updated 9 months ago
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆111Updated 2 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆106Updated this week
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆42Updated 2 weeks ago
- ☆16Updated last year
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆25Updated 5 months ago
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆68Updated 9 months ago
- ☆69Updated last week
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆58Updated last month
- Official Code Release of Delphi☆53Updated 7 months ago
- Official code of "Segment any 3D Object with Language"☆39Updated 8 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆27Updated 4 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆67Updated last month
- Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"☆108Updated this week
- ☆44Updated last month
- MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning☆62Updated 9 months ago