OpenRobotLab / StreamVLNLinks
Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
☆76Updated this week
Alternatives and similar repositories for StreamVLN
Users that are interested in StreamVLN are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆160Updated 3 weeks ago
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆175Updated last week
- Official code for the CVPR 2025 paper "Navigation World Models".☆297Updated this week
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆82Updated 8 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆149Updated 3 weeks ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆40Updated 3 weeks ago
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆138Updated last month
- List of papers on video-centric robot learning☆21Updated 7 months ago
- ☆55Updated 4 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆119Updated last month
- ☆41Updated last year
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆86Updated 3 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆232Updated 3 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆37Updated 3 months ago
- [arXiv 2025] CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation☆28Updated 2 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆108Updated last month
- Code repository for the Habitat Synthetic Scenes Dataset (HSSD) paper.☆94Updated last year
- ☆63Updated last month
- Generative Artificial Intelligence in Robotic Manipulation: A Survey☆65Updated last week
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆42Updated last week
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆27Updated this week
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆56Updated 3 months ago
- ICCV2025☆103Updated last week
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆232Updated 2 weeks ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆152Updated 2 months ago
- [ICCV 2025] VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers☆48Updated last week
- [NeurIPS 2024] Official code repository for MSR3D paper☆60Updated 3 weeks ago
- [RSS 2024] Learning Manipulation by Predicting Interaction☆110Updated last week
- ☆67Updated 6 months ago
- ☆101Updated last year