silence143 / EMMOELinks
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
☆23Updated 6 months ago
Alternatives and similar repositories for EMMOE
Users that are interested in EMMOE are comparing it to the libraries listed below
Sorting:
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆42Updated 2 months ago
- [CoRL 2025] Robot Learning from Any Images☆34Updated last month
- ☆43Updated 5 months ago
- VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning☆103Updated 2 months ago
- Official Implementation for “CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World” (RSS 2025).☆38Updated 2 weeks ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆89Updated last year
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆45Updated last month
- Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models☆75Updated 2 weeks ago
- DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation☆27Updated this week
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆51Updated 6 months ago
- Code Repository for ControlVLA, CoRL2025.☆77Updated last month
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆40Updated 5 months ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆25Updated 2 months ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆57Updated 7 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆28Updated 8 months ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆144Updated last week
- ☆61Updated 11 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆108Updated last month
- Geometry-aware 4D Video Generation for Robot Manipulation☆66Updated 3 months ago
- ☆52Updated last year
- ☆41Updated 5 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆51Updated 8 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆90Updated 6 months ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆45Updated 5 months ago
- PR2 is a humanoid robot testbed designed for both entry-level students and professional users with supports in bipedal locomotion, multi-…☆25Updated this week
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆32Updated 6 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆30Updated 3 weeks ago
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆15Updated last year
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆80Updated last year
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆20Updated 2 years ago