F1y1113 / HA-VLNLinks
Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard".
β368Updated last month
Alternatives and similar repositories for HA-VLN
Users that are interested in HA-VLN are comparing it to the libraries listed below
Sorting:
- π₯ The first open-sourced diffusion vision-langauge-action model.β134Updated this week
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ388Updated 2 weeks ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ359Updated this week
- β293Updated last month
- β545Updated last month
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β257Updated last month
- β248Updated 11 months ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025β53Updated 3 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ341Updated last month
- GigaTrain: An Efficient and Scalable Training Framework for AI Modelsβ252Updated last week
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ605Updated last week
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning wβ¦β50Updated 9 months ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ191Updated this week
- Efficient View Path Planning for Autonomous Implicit Reconstruction οΌICRA 2023οΌβ52Updated last year
- Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration οΌICRA 2024οΌβ50Updated last year
- β94Updated 5 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β200Updated 3 years ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.β100Updated last week
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!β133Updated 2 months ago
- DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM (RA-L 2025)β199Updated 9 months ago
- hybrid sfm with VIO Pose,RGB and depth dataβ52Updated 2 years ago
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory Systemβ101Updated 4 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Modelsβ159Updated last week
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understandingβ73Updated 4 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detectionβ510Updated 5 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualizationβ88Updated last month
- [AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequencesβ176Updated this week
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Executionβ354Updated last week
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ214Updated last month
- β223Updated last month