allenai / spoc-robot-trainingView external linksLinks
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
☆146Nov 4, 2024Updated last year
Alternatives and similar repositories for spoc-robot-training
Users that are interested in spoc-robot-training are comparing it to the libraries listed below
Sorting:
- Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation☆62Jan 15, 2025Updated last year
- ☆124Jul 9, 2024Updated last year
- [IROS 24] Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We…☆18Jan 8, 2025Updated last year
- Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)☆16Jun 7, 2024Updated last year
- ☆18Mar 12, 2025Updated 11 months ago
- ☆82Aug 20, 2025Updated 5 months ago
- [ICRA 25] FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning☆43Jan 5, 2025Updated last year
- Mobile manipulation research tools for roboticists☆1,186Jun 8, 2024Updated last year
- PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators☆106Nov 21, 2024Updated last year
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆20May 2, 2025Updated 9 months ago
- The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)☆677Nov 12, 2025Updated 3 months ago
- [RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"☆429Jan 19, 2026Updated 3 weeks ago
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆29Jul 19, 2025Updated 6 months ago
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆43Mar 16, 2023Updated 2 years ago
- ☆194Mar 29, 2025Updated 10 months ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆314Nov 7, 2023Updated 2 years ago
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆105Apr 2, 2025Updated 10 months ago
- RL training scripts for learning an agent using ProcTHOR.☆37Feb 18, 2025Updated 11 months ago
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆238Sep 20, 2024Updated last year
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆416Apr 5, 2025Updated 10 months ago
- ☆19May 7, 2025Updated 9 months ago
- Official implementation of Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts(IJCAI 2024)☆15Oct 16, 2024Updated last year
- Fast-Slow Test-time Adaptation for Online Vision-and-Language Navigation☆30Dec 5, 2025Updated 2 months ago
- Human-centered Delivery Benchmark☆20Jul 24, 2024Updated last year
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆61Apr 11, 2024Updated last year
- Vision-and-Language Navigation in Continuous Environments using Habitat☆722Jan 7, 2025Updated last year
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆255Jun 27, 2023Updated 2 years ago
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆45Aug 6, 2024Updated last year
- Official repository for LeLaN training and inference code☆131Sep 27, 2024Updated last year
- [ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"☆403Nov 2, 2025Updated 3 months ago
- Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", …☆130Oct 30, 2024Updated last year
- [ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language☆17Dec 3, 2024Updated last year
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆133Oct 24, 2024Updated last year
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation☆99Dec 30, 2024Updated last year
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆228Jun 18, 2024Updated last year
- 🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses☆421Apr 7, 2023Updated 2 years ago
- ☆57Aug 18, 2025Updated 5 months ago
- [ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"☆247Oct 31, 2023Updated 2 years ago
- Official implementation of GridMM: Grid Memory Map for Vision-and-Language Navigation (ICCV'23).☆102Apr 18, 2024Updated last year