Xiaofeng-Han-Res / MF-RVLinks
A survey on Multimodal Fusion for Robot Vision
☆23Updated 7 months ago
Alternatives and similar repositories for MF-RV
Users that are interested in MF-RV are comparing it to the libraries listed below
Sorting:
- Generative Artificial Intelligence in Robotic Manipulation: A Survey☆78Updated 2 months ago
- ✨✨【NeurIPS 2025】Official implementation of BridgeVLA☆131Updated last week
- [CoRL 2025] Repository relating to "TrackVLA: Embodied Visual Tracking in the Wild"☆222Updated last month
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆104Updated 5 months ago
- ☆75Updated 8 months ago
- ☆93Updated 2 weeks ago
- [ECCV 2024] Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking☆108Updated last year
- [NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆192Updated 2 months ago
- [RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"☆302Updated last month
- An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.☆134Updated last week
- ☆51Updated last week
- [Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV,…☆371Updated last month
- Collect some related resources of NVIDIA Isaac Sim☆87Updated last month
- GraspSplats: Efficient Manipulation with 3D Feature Splatting☆130Updated 10 months ago
- Dynamic 3D Gaussian Scene Graphs for Environment Adaptation☆48Updated 3 months ago
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆37Updated 2 months ago
- A curated list of awesome Vision-and-Language Navigation(VLN) resources (continually updated)☆102Updated 6 months ago
- [CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation☆226Updated last week
- InternRobotics' open platform for building generalized navigation foundation models.☆301Updated this week
- ☆76Updated last month
- [RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.☆103Updated 4 months ago
- ☆39Updated last month
- [ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-S…☆73Updated 3 months ago
- ☆274Updated this week
- [CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos☆136Updated last week
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆19Updated 5 months ago
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆56Updated 6 months ago
- [ECCV 2024] GenPose++: A generative category-level 6D object pose estimation and tracking approach proposed in Omni6DPose.☆87Updated last month
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆140Updated 3 months ago
- ICRA2025 Paper List☆276Updated 4 months ago