oceanhao / CoNavLinks
CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigation
☆17Updated 7 months ago
Alternatives and similar repositories for CoNav
Users that are interested in CoNav are comparing it to the libraries listed below
Sorting:
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆377Updated 3 weeks ago
- [ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"☆35Updated 7 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆134Updated 3 months ago
- Official code of Motus: A Unified Latent Action World Model☆541Updated this week
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆149Updated last week
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆1,243Updated last month
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆466Updated 3 weeks ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆1,021Updated last month
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆265Updated 3 weeks ago
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.☆748Updated 4 months ago
- ☆224Updated 2 months ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆263Updated 2 months ago
- This repository serves as a central navigator for the various components of my Final Year Project (FYP).☆23Updated 3 weeks ago
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆57Updated last month
- ☆545Updated 2 months ago
- A Python tool to crawl historical arXiv papers from specified categories, filter them using a custom LLM prompt via Alibaba Cloud's DashS…☆20Updated 5 months ago
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving☆72Updated last week
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model☆1,875Updated last month
- ☆247Updated 11 months ago
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding☆75Updated 5 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆347Updated 3 weeks ago
- ☆94Updated 6 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆518Updated 5 months ago
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆194Updated 2 weeks ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆679Updated last month
- ☆18Updated 6 months ago
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior☆72Updated 9 months ago
- ☆55Updated last month
- A multi-agent debate framework supporting AI-vs-AI and Human-vs-AI modes with customizable models, personas, and role-specific prompts.☆63Updated last month
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆265Updated 2 months ago