lanlan96 / BoxFusionLinks
[PG 2025] BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion
☆56Updated 2 weeks ago
Alternatives and similar repositories for BoxFusion
Users that are interested in BoxFusion are comparing it to the libraries listed below
Sorting:
- [2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation☆51Updated 2 months ago
- [ICLR 2025] NextBestPath: Efficient 3D Mapping of Unseen Environments☆75Updated 2 months ago
- [TRO 2025] OmniMap: A General Mapping Framework Integrating Optics, Geometry, and Semantics☆136Updated last month
- Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion (ICCV 2025)☆77Updated 4 months ago
- ☆43Updated 2 months ago
- 📚 2025 Scene Graph ArXiv Paper List — Updated Daily☆15Updated this week
- ☆53Updated 2 months ago
- [CoRL 2025] See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation☆102Updated last week
- ☆89Updated 11 months ago
- [CVPR2025] ProxyTransformation : Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding☆48Updated 5 months ago
- [NeurIPS 2025] SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation and Understanding☆43Updated 2 months ago
- [CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation☆63Updated 4 months ago
- 🔥RayletDF in PyTorch (ICCV 2025 Highlight)☆37Updated 3 months ago
- 3D Diffusion Semantic Scenes☆49Updated 2 months ago
- Source code for [ECCV2024]O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation☆21Updated 10 months ago
- Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)☆75Updated last month
- [CoRL 2025] CogniPlan: Uncertainty-Guided Path Planning with Conditional Generative Layout Prediction - Public code and model☆42Updated last week
- [ICCV'25] 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection☆109Updated 3 months ago
- [RSS 2025] GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction☆82Updated 4 months ago
- Official implementation of "AirSim360: A Panoramic Simulation Platform within Drone View"☆84Updated last month
- Toolbox for the OpenLex3D benchmark☆33Updated 3 months ago
- [IEEE IROS'25] GSPR: Multimodal Place Recognition using 3D Gaussian Splatting for Autonomous Driving☆55Updated 3 months ago
- [CVPR 2025] PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding☆95Updated 7 months ago
- [CVPR 2025] MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction☆72Updated 9 months ago
- [ICRA25] A Novel Decomposed Feature-Oriented Framework for Open-Set Semantic Segmentation on LiDAR Data☆23Updated 6 months ago
- ☆27Updated 8 months ago
- [RAL-25] An online open-vocabulary mapping system that enables natural language querying to navigate dynamic scenes, with ROS support.☆151Updated last month
- [IROS 2025] Source code for "RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration"☆112Updated last week
- SORT3D, an LLM-based object-centric grounding and indoor navigation system employing a spatial reasoning toolbox and state of the art 2D …☆87Updated 6 months ago
- Official implementation of IROS 2025 paper Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline☆50Updated 5 months ago