Xiaofeng-Han-Res / MF-RVLinks
A survey on Multimodal Fusion for Robot Vision
☆34Updated 3 months ago
Alternatives and similar repositories for MF-RV
Users that are interested in MF-RV are comparing it to the libraries listed below
Sorting:
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆46Updated 5 months ago
- ☆99Updated last year
- [CoRL 2025] Repository relating to "TrackVLA: Embodied Visual Tracking in the Wild"☆315Updated last month
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆132Updated 8 months ago
- [CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation☆289Updated 3 months ago
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆69Updated 2 months ago
- ✨✨【NeurIPS 2025】Official implementation of BridgeVLA☆163Updated 3 months ago
- ☆35Updated 5 months ago
- ☆223Updated 5 months ago
- ☆130Updated 3 months ago
- [ECCV 2024] Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking☆117Updated last year
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆48Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆171Updated 6 months ago
- ☆20Updated 4 months ago
- A curated list of awesome Vision-and-Language Navigation(VLN) resources (continually updated)☆110Updated 10 months ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆58Updated last year
- [ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-S…☆110Updated 7 months ago
- MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation☆56Updated 2 months ago
- PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation☆182Updated last year
- [CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos☆174Updated 3 months ago
- [IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation☆135Updated 2 months ago
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆68Updated 9 months ago
- code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"☆42Updated last month
- the official implementation of CogNav [ICCV 2025]☆55Updated 3 months ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆156Updated this week
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆75Updated 2 weeks ago
- ☆238Updated 3 weeks ago
- [CVPR24] Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation☆83Updated last year
- ☆58Updated 2 weeks ago
- Generative Artificial Intelligence in Robotic Manipulation: A Survey☆84Updated 6 months ago