Koorye / InspireLinks
Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"
☆34Updated last week
Alternatives and similar repositories for Inspire
Users that are interested in Inspire are comparing it to the libraries listed below
Sorting:
- ☆55Updated 4 months ago
- ICCV2025☆105Updated this week
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆67Updated 7 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆39Updated 3 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆21Updated last month
- 🦾 A Dual-System VLA with System2 Thinking☆66Updated last week
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆88Updated 3 months ago
- ☆24Updated last week
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆147Updated last month
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆114Updated 9 months ago
- DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆71Updated this week
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning☆53Updated last week
- List of papers on video-centric robot learning☆21Updated 8 months ago
- [ICCV 2025] Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts☆21Updated 7 months ago
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆176Updated 2 weeks ago
- ☆49Updated 7 months ago
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆68Updated 2 months ago
- ☆75Updated 10 months ago
- [ICCV 2025] Latent Motion Token as the Bridging Language for Robot Manipulation☆110Updated 2 months ago
- [ICCV 2025] VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers☆51Updated 2 weeks ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆19Updated last year
- Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆96Updated last week
- [CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"☆45Updated 3 weeks ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆32Updated 6 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆82Updated 9 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆39Updated 2 years ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆151Updated 3 weeks ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆77Updated last month
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆40Updated 3 weeks ago
- ☆28Updated 2 months ago