AIGeeksGroup / Nav-R1Links
Nav-R1: Reasoning and Navigation in Embodied Scenes
☆30Updated this week
Alternatives and similar repositories for Nav-R1
Users that are interested in Nav-R1 are comparing it to the libraries listed below
Sorting:
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆26Updated 2 years ago
- Embodied Instruction Following in Unknown Environments☆17Updated 5 months ago
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆30Updated last year
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆80Updated 3 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆26Updated 3 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆113Updated 3 months ago
- Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).☆30Updated 2 months ago
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆52Updated 2 weeks ago
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆110Updated last month
- Unifying 2D and 3D Vision-Language Understanding☆104Updated last month
- Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"☆66Updated 3 weeks ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆69Updated 6 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆55Updated last year
- EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments☆19Updated 4 months ago
- F1: A Vision Language Action Model Bridging Understanding and Generation to Actions☆72Updated last week
- Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel☆27Updated 3 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆114Updated 6 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆130Updated 10 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆46Updated 5 months ago
- [CVPR2025] ProxyTransformation : Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding☆43Updated 2 weeks ago
- ☆24Updated 3 months ago
- ☆60Updated last month
- Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆82Updated last month
- ☆50Updated 11 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆116Updated last month
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆43Updated 8 months ago
- ☆39Updated 2 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆28Updated 7 months ago
- ☆24Updated 4 months ago
- [CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…☆61Updated 5 months ago