Xiaofeng-Han-Res / MF-RVLinks
A survey on Multimodal Fusion for Robot Vision
☆29Updated 3 weeks ago
Alternatives and similar repositories for MF-RV
Users that are interested in MF-RV are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆168Updated 4 months ago
- ☆89Updated 10 months ago
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆19Updated 6 months ago
- ☆34Updated 3 months ago
- ✨✨【NeurIPS 2025】Official implementation of BridgeVLA☆154Updated last month
- ☆37Updated last month
- ☆200Updated 3 months ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆141Updated last month
- [RA-L 2024] GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping☆162Updated last year
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆42Updated 3 months ago
- 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)☆57Updated 3 months ago
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆116Updated 6 months ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆46Updated 10 months ago
- Open-source implementations on real robots☆34Updated 11 months ago
- ☆111Updated last month
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆213Updated last month
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆182Updated 6 months ago
- GraspSplats: Efficient Manipulation with 3D Feature Splatting☆137Updated 11 months ago
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆63Updated 2 months ago
- [NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆200Updated 4 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆166Updated 4 months ago
- Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.☆58Updated last month
- A curated list of large VLM-based VLA models for robotic manipulation.☆231Updated last month
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆73Updated 8 months ago
- [CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation☆258Updated last month
- [ECCV 2024] Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking☆109Updated last year
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆40Updated 3 months ago
- [ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-S…☆80Updated 5 months ago
- PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation☆161Updated 11 months ago
- Official repository of General Scene Adaptation for Vision-and-Language Navigation (ICLR'2025)☆52Updated 6 months ago