DAVIAN-Robotics / ACGLinks
Code for "ACG: Action Coherence Guidance for Flow-based VLA Models" (ICRA 2026)
☆59Updated last week
Alternatives and similar repositories for ACG
Users that are interested in ACG are comparing it to the libraries listed below
Sorting:
- code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"☆44Updated 2 months ago
- ☆30Updated last year
- ☆36Updated last month
- ☆27Updated last month
- ☆14Updated 8 months ago
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆54Updated 4 months ago
- ☆103Updated 3 weeks ago
- Describe Anything, Anywhere, at Any Moment (DAAAM), a novel approach to real-time, large-scale, spatio-temporal memory☆144Updated 2 months ago
- Object-Relative Navigation: ObjectReact [CoRL'25] | TANGO [ICRA'25] | RoboHop [ICRA'24]☆36Updated 4 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆53Updated 2 months ago
- [CoRL 2024] Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models☆32Updated last year
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆76Updated 3 months ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆43Updated 7 months ago
- HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆46Updated last month
- [IROS 2025] DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects☆63Updated 4 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆28Updated 2 months ago
- Sim2real robot manipulation utilizing GS modeling☆14Updated 11 months ago
- Legged Open-Vocabulary Object Navigator☆81Updated 3 months ago
- Implementation of Radiance Fields for Robotic Teleoperation☆59Updated 8 months ago
- Official repository for LeLaN training and inference code☆130Updated last year
- [ICML 2025] Official implementation of Spherical Diffusion Policy: A SE(3) Equivariant Visuomotor Policy with Spherical Fourier Represent…☆38Updated 7 months ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆63Updated 6 months ago
- X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real☆56Updated last month
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆137Updated 9 months ago
- ☆30Updated 3 weeks ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆167Updated 3 weeks ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆33Updated 2 months ago
- source code and trained models for DeFM (Depth Foundation Model)☆85Updated this week
- [RAL-25] An online open-vocabulary mapping system that enables natural language querying to navigate dynamic scenes, with ROS support.☆151Updated last month
- Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)☆75Updated last month