shengliangd / StereoVLALinks
StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.
☆25Updated last week
Alternatives and similar repositories for StereoVLA
Users that are interested in StereoVLA are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆32Updated last month
- ☆34Updated 3 weeks ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆46Updated 3 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆72Updated last month
- Code for "ACG: Action Coherence Guidance for Flow-based VLA Models"☆44Updated 2 months ago
- ☆113Updated this week
- A toolbox for real-to-sim reconstruction and robotic simulation☆183Updated this week
- ☆14Updated 7 months ago
- Code for "High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting"☆46Updated this week
- EO: Open-source Unified Embodied Foundation Model Series☆34Updated 2 weeks ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆121Updated 2 months ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆156Updated this week
- ☆22Updated this week
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆57Updated 2 months ago
- HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆37Updated 3 weeks ago
- Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models☆96Updated this week
- ☆123Updated last month
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆42Updated 6 months ago
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆15Updated this week
- Sim2real robot manipulation utilizing GS modeling☆13Updated 10 months ago
- Code Repository for ControlVLA, CoRL2025.☆81Updated 2 months ago
- Code implementation of the paper "World-in-World: World Models in a Closed-Loop World"☆121Updated 2 weeks ago
- A unified robotic manipulation learning framework☆21Updated 4 months ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆24Updated 3 months ago
- Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"☆67Updated 3 weeks ago
- Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation☆100Updated 5 months ago
- ☆30Updated 4 months ago
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆153Updated last month
- Code for the robot-assisted feeding project at EmPRISE Lab☆24Updated 2 weeks ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆51Updated last month