silence143 / EMMOELinks
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
☆15Updated 3 weeks ago
Alternatives and similar repositories for EMMOE
Users that are interested in EMMOE are comparing it to the libraries listed below
Sorting:
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆73Updated last week
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆78Updated 7 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆66Updated 5 months ago
- ☆72Updated 9 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆38Updated 5 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 11 months ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆48Updated last month
- [arXiv 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆48Updated this week
- ☆49Updated 8 months ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆31Updated 5 months ago
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆25Updated last year
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆27Updated 3 months ago
- Official PyTorch implementation of Doduo: Dense Visual Correspondence from Unsupervised Semantic-Aware Flow☆44Updated last year
- Code for Ditto in the House: Building Articulation Models of Indoor Scenes through Interactive Perception☆17Updated last year
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆39Updated 10 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆43Updated last year
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆104Updated 6 months ago
- List of papers on video-centric robot learning☆20Updated 6 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆68Updated 2 months ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆19Updated last year
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆87Updated 10 months ago
- [CVPR 2024] Official repository for "Tactile-Augmented Radiance Fields".☆61Updated 3 months ago
- FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning☆21Updated 5 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆34Updated 2 months ago
- Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts☆18Updated 5 months ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Updated 2 years ago
- Official implementation of "Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation"☆99Updated 2 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆38Updated 2 years ago
- IROS 2024 | PreAfford: Universal Affordance-Based Pre-grasping for Diverse Objects and Scenes☆11Updated 8 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆16Updated 2 weeks ago