oceanhao / CoNavLinks
CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigation
☆17Updated 8 months ago
Alternatives and similar repositories for CoNav
Users that are interested in CoNav are comparing it to the libraries listed below
Sorting:
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆378Updated last month
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆135Updated 4 months ago
- [ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"☆35Updated 8 months ago
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆159Updated 3 weeks ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆480Updated last week
- ☆224Updated 2 months ago
- Official code of Motus: A Unified Latent Action World Model☆597Updated 3 weeks ago
- This repository serves as a central navigator for the various components of my Final Year Project (FYP).☆24Updated last month
- ☆545Updated 3 months ago
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆58Updated 2 months ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆1,328Updated last month
- A Python tool to crawl historical arXiv papers from specified categories, filter them using a custom LLM prompt via Alibaba Cloud's DashS…☆20Updated 6 months ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆268Updated 3 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆350Updated last month
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆208Updated 3 weeks ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆2,064Updated 2 months ago
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving☆158Updated last month
- ☆93Updated 6 months ago
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.☆754Updated 5 months ago
- [ICRA2024] The official implementation of Robot Trajectron☆111Updated last month
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆177Updated last week
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding☆75Updated 5 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆130Updated 3 months ago
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior☆72Updated 10 months ago
- ☆18Updated 6 months ago
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving☆159Updated last month
- ☆246Updated last year
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆269Updated 3 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆181Updated last month
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆489Updated 3 weeks ago