oceanhao / CoNavLinks
CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigation
☆17Updated 8 months ago
Alternatives and similar repositories for CoNav
Users that are interested in CoNav are comparing it to the libraries listed below
Sorting:
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆378Updated last month
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆160Updated 3 weeks ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆136Updated 4 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆482Updated 2 weeks ago
- [ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"☆35Updated 8 months ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆270Updated 3 months ago
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.☆757Updated 5 months ago
- ☆223Updated 3 months ago
- ☆545Updated 3 months ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆1,439Updated 2 months ago
- Official code of Motus: A Unified Latent Action World Model☆616Updated last month
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving☆166Updated this week
- ☆93Updated 6 months ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆2,236Updated this week
- This repository serves as a central navigator for the various components of my Final Year Project (FYP).☆24Updated last month
- ☆246Updated last year
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving☆159Updated last month
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆178Updated 2 weeks ago
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆216Updated 3 weeks ago
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆58Updated 2 months ago
- A Python tool to crawl historical arXiv papers from specified categories, filter them using a custom LLM prompt via Alibaba Cloud's DashS…☆20Updated 6 months ago
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior☆73Updated 10 months ago
- ☆19Updated 9 months ago
- [ICRA2024] The official implementation of Robot Trajectron☆111Updated last month
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆350Updated last month
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025☆55Updated 5 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆130Updated 3 months ago
- The accepted paper for cvpr2025.☆55Updated last month
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆51Updated 11 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆199Updated 3 years ago