F1y1113 / HA-VLNLinks
Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environments with Dynamic Multi-Human Interactions".
β378Updated last month
Alternatives and similar repositories for HA-VLN
Users that are interested in HA-VLN are comparing it to the libraries listed below
Sorting:
- π₯ The first open-sourced diffusion vision-langauge-action model.β153Updated 2 weeks ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β267Updated 2 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ472Updated last month
- β314Updated 3 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ350Updated last month
- β246Updated last year
- β543Updated 2 months ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ1,853Updated last month
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ128Updated 2 weeks ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ1,226Updated last month
- Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration οΌICRA 2024οΌβ50Updated last year
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ180Updated last month
- Official code of Motus: A Unified Latent Action World Modelβ580Updated 2 weeks ago
- Efficient View Path Planning for Autonomous Implicit Reconstruction οΌICRA 2023οΌβ52Updated last year
- β92Updated 6 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β199Updated 3 years ago
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Drivingβ98Updated last month
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ198Updated 3 weeks ago
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning wβ¦β50Updated 10 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.β427Updated 2 weeks ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025β55Updated 5 months ago
- Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challengesβ295Updated last week
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory Systemβ99Updated 5 months ago
- [ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"β35Updated 7 months ago
- DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM (RA-L 2025)β202Updated last month
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understandingβ76Updated 5 months ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Executionβ356Updated last month
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ215Updated 2 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Modelsβ834Updated last month
- hybrid sfm with VIO Pose,RGB and depth dataβ52Updated 2 years ago