OpenRobotLab / StreamVLNLinks

Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

☆76

Alternatives and similar repositories for StreamVLN

Users that are interested in StreamVLN are comparing it to the libraries listed below

Sorting:

HaoyiZhu / SPA
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
☆160Updated 3 weeks ago
qizekun / SoFar
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
☆175Updated last week
facebookresearch / nwm
Official code for the CVPR 2025 paper "Navigation World Models".
☆297Updated this week
HaoyiZhu / PointCloudMatters
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
☆82Updated 8 months ago
PKU-HMI-Lab / LIFT3D
[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
☆149Updated 3 weeks ago
vlc-robot / robot_sugar
Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).
☆40Updated 3 weeks ago
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆138Updated last month
jmwang0117 / Video4Robot
List of papers on video-centric robot learning
☆21Updated 7 months ago
liufanfanlff / RoboUniview
☆55Updated 4 months ago
OpenRobotLab / RoboSplat
[RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation
☆119Updated last month
cshizhe / onav_rim
☆41Updated last year
MrZihan / HNR-VLN
Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…
☆86Updated 3 months ago
GuanxingLu / ManiGaussian
[ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
☆232Updated 3 months ago
MCG-NJU / Tra-MoE
[CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
☆37Updated 3 months ago
OpenRobotLab / CronusVLA
[arXiv 2025] CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation
☆28Updated 2 weeks ago
OpenRobotLab / VLM-Grounder
[CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
☆108Updated last month
3dlg-hcvc / hssd
Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.
☆94Updated last year
pzhren / InfiniteWorld
☆63Updated last month
GAI4Manipulation / AwesomeGAIManipulation
Generative Artificial Intelligence in Robotic Manipulation: A Survey
☆65Updated last week
OpenRobotLab / MMSI-Bench
[arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
☆42Updated last week
MrZihan / g3D-LF
Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).
☆27Updated this week
roomtour3d / roomtour3d-NaviLLM
[CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation
☆56Updated 3 months ago
RoboDita / Dita
ICCV2025
☆103Updated last week
ShuangLI59 / unified_video_action
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆232Updated 2 weeks ago
iris0329 / SeeGround
[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
☆152Updated 2 months ago
xiaoxiao0406 / VQ-VLA
[ICCV 2025] VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers
☆48Updated last week
MSR3D / MSR3D
[NeurIPS 2024] Official code repository for MSR3D paper
☆60Updated 3 weeks ago
OpenDriveLab / MPI
[RSS 2024] Learning Manipulation by Predicting Interaction
☆110Updated last week
HaochenZ11 / VLA-3D
☆67Updated 6 months ago
Ram81 / goat-bench
☆101Updated last year