wentaoyuan / RoboPoint
A Vision-Language Model for Spatial Affordance Prediction in Robotics
☆51Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for RoboPoint
- [CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation☆80Updated last month
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆60Updated 3 months ago
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation☆46Updated 2 weeks ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆88Updated last month
- ☆36Updated 3 weeks ago
- ☆28Updated last week
- ☆31Updated 10 months ago
- Cross-Embodiment Robot Learning Codebase☆35Updated 6 months ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆43Updated 3 months ago
- Hand-object interaction Pretraining From Videos☆60Updated 2 weeks ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement☆121Updated last week
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface☆50Updated 3 weeks ago
- ☆33Updated 2 months ago
- ☆57Updated 3 weeks ago
- Official implementation of GROOT, CoRL 2023☆51Updated last year
- [ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…☆44Updated 2 weeks ago
- [arXiv 2024] Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies. Part 2: Humanoid Teleoperation☆61Updated 3 weeks ago
- [CoRL 2024] ClutterGen: A Cluttered Scene Generator for Robot Learning☆28Updated last month
- A unified architecture for multimodal multi-task robotic policy learning.☆121Updated 9 months ago
- main augmentation script for real world robot dataset.☆31Updated last year
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆72Updated 2 months ago
- Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning☆74Updated 4 months ago
- [ICCV 2023] Official code repository for ARNOLD benchmark☆139Updated 7 months ago
- SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World☆91Updated last week
- Data collection part for ARCap☆34Updated this week
- ☆88Updated last year
- Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024☆49Updated 3 weeks ago
- Official Site for ManiFoundation Model☆40Updated 5 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆41Updated 4 months ago
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆63Updated 3 weeks ago