facebookresearch / nwmLinks

Official code for the CVPR 2025 paper "Navigation World Models".

☆440

Alternatives and similar repositories for nwm

Users that are interested in nwm are comparing it to the libraries listed below

Sorting:

MTU3D / MTU3D
☆208Updated 3 months ago
InternRobotics / StreamVLN
Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
☆300Updated 2 weeks ago
ai4ce / CityWalker
[CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
☆151Updated 2 months ago
ZCMax / LLaVA-3D
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆348Updated last month
InternRobotics / InternNav
InternRobotics' open platform for building generalized navigation foundation models.
☆393Updated 2 weeks ago
HaoyiZhu / SPA
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
☆169Updated 5 months ago
VITA-Group / VLM-3R
VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆295Updated 2 months ago
iris0329 / SeeGround
[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
☆183Updated 7 months ago
AnjieCheng / SpatialRGPT
[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"
☆281Updated 11 months ago
Zhangwenyao1 / DreamVLA
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆224Updated 2 months ago
UMass-Embodied-AGI / 3D-Mem
[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"
☆193Updated last month
diankun-wu / Spatial-MLLM
Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆386Updated 5 months ago
InternRobotics / VLM-Grounder
[CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
☆119Updated 6 months ago
metadriverse / metaurban
[ICLR 2025 Spotlight] MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility
☆210Updated 3 weeks ago
alibaba-damo-academy / WorldVLA
WorldVLA: Towards Autoregressive Action World Model
☆539Updated last month
baaivision / UniVLA
Unified Vision-Language-Action Model
☆226Updated last month
GuanxingLu / ManiGaussian
[ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
☆248Updated 7 months ago
Zhoues / RoboRefer
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
☆195Updated 3 weeks ago
PKU-HMI-Lab / LIFT3D
[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
☆167Updated 5 months ago
zjwzcx / GLEAM
[ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
☆147Updated last month
ShuangLI59 / unified_video_action
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆291Updated 3 months ago
NVlabs / RoboSpatial
☆114Updated last month
qizekun / SoFar
[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
☆203Updated 4 months ago
LaVi-Lab / Video-3D-LLM
[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.
☆177Updated 5 months ago
nvidia-cosmos / cosmos-transfer2.5
Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control in…
☆205Updated last week
Little-Podi / AdaWorld
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆175Updated 5 months ago
HaochenZ11 / VLA-3D
☆90Updated 10 months ago
pzhren / InfiniteWorld
☆78Updated 3 months ago
AIGeeksGroup / Nav-R1
Nav-R1: Reasoning and Navigation in Embodied Scenes
☆71Updated 3 weeks ago
xiaoxiao0406 / VQ-VLA
The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
☆94Updated last week