worldbench / awesome-spatial-intelligenceLinks
π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems
β135Updated last week
Alternatives and similar repositories for awesome-spatial-intelligence
Users that are interested in awesome-spatial-intelligence are comparing it to the libraries listed below
Sorting:
- Official Implementation of Puzzles: Unbounded Video-Depth Augmentation for Scalable, End-to-End 3D Reconstruction.β210Updated 4 months ago
- β321Updated 3 months ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ203Updated last month
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ181Updated 2 months ago
- π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real Worldβ178Updated 3 weeks ago
- [ICCV 2025] Perspective-Invariant 3D Object Detectionβ158Updated last month
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]β15Updated 9 months ago
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Drivingβ166Updated last week
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ215Updated 3 months ago
- [ICCV2025 Highlight] Stereo Any Video: Temporally Consistent Stereo Matchingβ385Updated last week
- [AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequencesβ184Updated last month
- β93Updated 7 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)β83Updated 2 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modelingβ82Updated last week
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ130Updated 4 months ago
- This repository contains the code of the paper "IC-World: In-Context Generation for Shared World Modeling".β123Updated 3 weeks ago
- Wan2.1 with Controlnetβ182Updated 10 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β380Updated last month
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β199Updated 3 years ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ350Updated last month
- hybrid sfm with VIO Pose,RGB and depth dataβ52Updated 2 years ago
- π₯ The first open-sourced diffusion vision-langauge-action model.β160Updated last month
- SeeU: Seeing the Unseen World via 4D Dynamics-aware Generationβ34Updated 2 months ago
- https://www.kaggle.com/competitions/image-matching-challenge-2022β45Updated 2 years ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β270Updated 3 months ago
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β176Updated 5 months ago
- [ICRA 2026] A Unified Driving World Model for Future Generation and Perceptionβ136Updated this week
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixelsβ180Updated last month
- This is an open source project that can track and segment specific objects in video streams by manual clicks, box selections, or text proβ¦β148Updated last month
- β389Updated 6 months ago