oceanhao / CoNavLinks
CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigation
☆17Updated 6 months ago
Alternatives and similar repositories for CoNav
Users that are interested in CoNav are comparing it to the libraries listed below
Sorting:
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human …☆368Updated last month
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆134Updated this week
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆133Updated 2 months ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆603Updated 2 weeks ago
- [ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"☆35Updated 6 months ago
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.☆738Updated 3 months ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆605Updated last week
- ☆223Updated last month
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆450Updated this week
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model☆1,784Updated 3 weeks ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆259Updated last month
- This repository serves as a central navigator for the various components of my Final Year Project (FYP).☆22Updated 6 months ago
- ☆545Updated last month
- ☆94Updated 5 months ago
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆56Updated 3 weeks ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆104Updated this week
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆140Updated last week
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆260Updated last month
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding☆74Updated 4 months ago
- A Python tool to crawl historical arXiv papers from specified categories, filter them using a custom LLM prompt via Alibaba Cloud's DashS…☆20Updated 5 months ago
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior☆72Updated 8 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆341Updated last month
- ☆248Updated 11 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆126Updated 2 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆252Updated last week
- ☆34Updated last year
- Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challenges☆255Updated last week
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆65Updated last year
- ☆55Updated 2 weeks ago
- 🐾 PawHaven — An open-source, enterprise-ready full-stack project powered by React, NestJS, and pnpm, featuring a Monorepo architecture t…☆86Updated this week