Insta360-Research-Team / panoramic-vision-surveyLinks
β308Updated 3 months ago
Alternatives and similar repositories for panoramic-vision-survey
Users that are interested in panoramic-vision-survey are comparing it to the libraries listed below
Sorting:
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ128Updated last week
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ176Updated last month
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]β15Updated 9 months ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ215Updated 2 months ago
- [ICCV2025 Highlight] Stereo Any Video: Temporally Consistent Stereo Matchingβ385Updated last month
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ351Updated last month
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modelingβ82Updated this week
- Official Implementation of Puzzles: Unbounded Video-Depth Augmentation for Scalable, End-to-End 3D Reconstruction.β210Updated 4 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)β83Updated last month
- hybrid sfm with VIO Pose,RGB and depth dataβ52Updated 2 years ago
- [CVPR 2024 Highlight] DiVa360 datasetβ94Updated 6 months ago
- Wan2.1 with Controlnetβ180Updated 9 months ago
- Text-to-3D Generation by 2D Editingβ112Updated 6 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β199Updated 3 years ago
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.β140Updated last month
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β176Updated 5 months ago
- DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM (RA-L 2025)β202Updated last month
- β543Updated 2 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β378Updated last month
- β385Updated 6 months ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025β55Updated 4 months ago
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"β103Updated last month
- [ICCV 2025] LocalDyGS : Multi-view Global Dynamic Scene Modeling through Adaptive Local Feature Decouplingβ109Updated last month
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β267Updated 2 months ago
- Official code of TSAR-MVS (Pattern Recognition 2024)β68Updated 3 months ago
- [ICLR 2025] This is official implements of Swift4d: Adaptive divide-and-conquer Gaussian Splatting for compact and efficient reconstructiβ¦β144Updated this week
- [ICCVβ25] Official implementation of paper "Towards Performance Consistency in Multi-Level Model Collaboration"β43Updated 2 months ago
- https://www.kaggle.com/competitions/image-matching-challenge-2022β45Updated 2 years ago
- β246Updated last year
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ198Updated 3 weeks ago