facebookresearch / nwmLinks
Official code for the CVPR 2025 paper "Navigation World Models".
☆156Updated last month
Alternatives and similar repositories for nwm
Users that are interested in nwm are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆154Updated last week
- [CVPR 2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos☆93Updated 2 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆229Updated 2 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆125Updated last month
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆106Updated last week
- [CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation☆136Updated last week
- ☆62Updated 4 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆104Updated 6 months ago
- ☆60Updated this week
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆78Updated 7 months ago
- Unifying 2D and 3D Vision-Language Understanding☆82Updated last month
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆27Updated 3 months ago
- PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation☆125Updated 6 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆199Updated 2 months ago
- GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆31Updated this week
- SceneFun3D ToolKit☆136Updated last month
- ☆98Updated 10 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆90Updated this week
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- ☆56Updated 2 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆142Updated 2 months ago
- Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.☆96Updated last week
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆65Updated 2 months ago
- [TMLR 2024] repository for VLN with foundation models☆122Updated 2 months ago
- [ICRA, 2025] SplatSim: Zero-Shot Sim2Real Transfer of RGB Manipulation Policies Using Gaussian Splatting☆105Updated 3 weeks ago
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆64Updated this week
- Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"☆51Updated last week
- Official repository of General Scene Adaptation for Vision-and-Language Navigation (ICLR'2025)☆40Updated last month
- [CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"☆115Updated last month
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆18Updated last month