☆17Mar 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for IRef-VLA
Users that are interested in IRef-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆107Mar 2, 2026Updated 3 weeks ago
- SORT3D, an LLM-based object-centric grounding and indoor navigation system employing a spatial reasoning toolbox and state of the art 2D …☆91Mar 2, 2026Updated 3 weeks ago
- [IROS 24] Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We…☆18Jan 8, 2025Updated last year
- This is the source code to paper “DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation”.☆30Aug 13, 2025Updated 7 months ago
- [ICCV 2025] MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation☆51Oct 14, 2025Updated 5 months ago
- CoReVLA: A Dual-Stage End-to-End Autonomous Driving Framework for Long-Tail Scenarios via Collect-and-Refine☆33Feb 2, 2026Updated last month
- Learning Dynamic Movement Primitives in Julia☆16Aug 28, 2024Updated last year
- ☆10Aug 16, 2024Updated last year
- MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots☆84Dec 5, 2025Updated 3 months ago
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆37Feb 23, 2026Updated last month
- CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models☆19Feb 28, 2026Updated 3 weeks ago
- ECCV[2024] "Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model" official implement☆16Jul 15, 2025Updated 8 months ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆32Nov 2, 2025Updated 4 months ago
- Algorithms for the mobility of decentralized mobile ground swarm robots, as modules that can be combined into complex real-world use-case…☆11May 17, 2021Updated 4 years ago
- ☆14Apr 12, 2023Updated 2 years ago
- ☆14Nov 9, 2025Updated 4 months ago
- Deep-RL-based safety landing using RGB camera on rough terrains. Exam Project for the ETH course "Perception and Learning for Robotics".☆13Nov 2, 2021Updated 4 years ago
- 🎵 an app to sync spotify playback between users (written before spotify implemented this feature)☆10Jul 12, 2023Updated 2 years ago
- ☆18Oct 22, 2024Updated last year
- ☆12Aug 22, 2024Updated last year
- Official Implementation of "Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting" (CW2025 Best Paper Honorable Me…☆24Oct 19, 2025Updated 5 months ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- ☆15Jun 14, 2025Updated 9 months ago
- Project Page of Paper "Drive in Corridors: Enhancing the Safety of End-to-end Autonomous Driving via Corridor Learning and Planning"☆28May 8, 2025Updated 10 months ago
- ☆15Jun 9, 2020Updated 5 years ago
- Computer Vision Tutorial☆16Jun 22, 2022Updated 3 years ago
- ☆10Dec 6, 2019Updated 6 years ago
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆57Jan 20, 2026Updated 2 months ago
- ☆26Jun 20, 2025Updated 9 months ago
- PointNet and GCNs for pointcloud classification.☆26Jul 19, 2020Updated 5 years ago
- ☆10Nov 16, 2023Updated 2 years ago
- Scaling structural learning with NO-BEARS☆14Dec 30, 2019Updated 6 years ago
- 🎧 Keep listening to music with your friends even from a social distance☆16Mar 20, 2021Updated 5 years ago
- [ICCV2025] RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation☆36Jul 21, 2025Updated 8 months ago
- Official implementation of NeurIPS 2022 paper "Learning Active Camera for Multi-Object Navigation"☆10Apr 23, 2023Updated 2 years ago
- ☆11Jul 16, 2024Updated last year
- 3D scene graph generator implemented in Pytorch.☆83Aug 11, 2019Updated 6 years ago
- ☆14Sep 3, 2023Updated 2 years ago
- Code release for Contrastive Gaussian Clustering (CGC), a method for zero-shot 3D scene segmentation.☆14Aug 8, 2024Updated last year