F1y1113 / HA-VLNLinks
Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environments with Dynamic Multi-Human Interactions".
β380Updated last month
Alternatives and similar repositories for HA-VLN
Users that are interested in HA-VLN are comparing it to the libraries listed below
Sorting:
- π₯ The first open-sourced diffusion vision-langauge-action model.β160Updated last month
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β270Updated 3 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ482Updated 3 weeks ago
- β545Updated 3 months ago
- β246Updated last year
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Drivingβ166Updated last week
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ2,246Updated this week
- β324Updated 3 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ350Updated last month
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ1,471Updated 2 months ago
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ135Updated last week
- β93Updated 7 months ago
- Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration οΌICRA 2024οΌβ50Updated last year
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ185Updated 2 months ago
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning wβ¦β51Updated 11 months ago
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Drivingβ171Updated last week
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ203Updated last month
- Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challengesβ296Updated last week
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory Systemβ100Updated 6 months ago
- [ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"β35Updated 8 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β199Updated 3 years ago
- π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real Worldβ180Updated 3 weeks ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025β55Updated 5 months ago
- Efficient View Path Planning for Autonomous Implicit Reconstruction οΌICRA 2023οΌβ52Updated last year
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Executionβ356Updated 2 months ago
- CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigationβ17Updated 8 months ago
- DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM (RA-L 2025)β203Updated 2 months ago
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.β759Updated 5 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detectionβ518Updated 7 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Modelsβ1,048Updated 2 months ago