worldbench / awesome-spatial-intelligenceLinks
π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems
β122Updated last week
Alternatives and similar repositories for awesome-spatial-intelligence
Users that are interested in awesome-spatial-intelligence are comparing it to the libraries listed below
Sorting:
- β308Updated 3 months ago
- Official Implementation of Puzzles: Unbounded Video-Depth Augmentation for Scalable, End-to-End 3D Reconstruction.β210Updated 3 months ago
- π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real Worldβ170Updated 3 weeks ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ215Updated 2 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ176Updated last month
- [ICCV2025 Highlight] Stereo Any Video: Temporally Consistent Stereo Matchingβ384Updated last month
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ197Updated 2 weeks ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modelingβ82Updated this week
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β199Updated 3 years ago
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]β15Updated 9 months ago
- Wan2.1 with Controlnetβ179Updated 9 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)β83Updated last month
- [AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequencesβ181Updated last month
- [ICCV 2025] Perspective-Invariant 3D Object Detectionβ152Updated 3 weeks ago
- β92Updated 6 months ago
- This repository contains the code of the paper "IC-World: In-Context Generation for Shared World Modeling".β90Updated 2 weeks ago
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Drivingβ71Updated 3 weeks ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixelsβ170Updated 3 weeks ago
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β175Updated 5 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ128Updated 3 months ago
- hybrid sfm with VIO Pose,RGB and depth dataβ52Updated 2 years ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ351Updated 3 weeks ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β377Updated 3 weeks ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β265Updated 2 months ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Drivingβ268Updated 2 months ago
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.β139Updated last month
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"β101Updated last month
- Data and sample evaluation codes for Multimodal Rewardbench 2β123Updated 3 weeks ago
- Text-to-3D Generation by 2D Editingβ112Updated 5 months ago
- [CVPR 2024 Highlight] DiVa360 datasetβ94Updated 6 months ago