A Vision-Language Model for Spatial Affordance Prediction in Robotics
β213Jul 17, 2025Updated 7 months ago
Alternatives and similar repositories for RoboPoint
Users that are interested in RoboPoint are comparing it to the libraries listed below
Sorting:
- [ICLR 2025π] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Larβ¦β91Jan 22, 2025Updated last year
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policyβ106Oct 24, 2024Updated last year
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulationβ99Dec 30, 2024Updated last year
- [ECCV 2024] π Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipuβ¦β96Nov 26, 2024Updated last year
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interfaceβ150Oct 17, 2024Updated last year
- β132Apr 25, 2023Updated 2 years ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)β94Jul 16, 2024Updated last year
- β89Sep 23, 2025Updated 5 months ago
- β75Jan 8, 2025Updated last year
- RialTo Policy Learning Pipelineβ198Sep 17, 2024Updated last year
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"β384Aug 17, 2024Updated last year
- ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulationβ911Feb 20, 2025Updated last year
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulationβ133Sep 8, 2025Updated 5 months ago
- [CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulationβ121Oct 26, 2025Updated 4 months ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representationsβ1,262Oct 17, 2025Updated 4 months ago
- Official implementation of "Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation"β133Sep 18, 2025Updated 5 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Gooβ¦β980Dec 20, 2025Updated 2 months ago
- A unified architecture for multimodal multi-task robotic policy learning.β176Feb 2, 2024Updated 2 years ago
- [RSS2025] Code for my paper "You Only Teach Once: Learn One-Shot Bimanual Robotic Manipulation from Video Demonstrations"β130Jul 12, 2025Updated 7 months ago
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learningβ41Oct 10, 2024Updated last year
- [RSS25] Official implementation of DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learningβ238Jul 18, 2025Updated 7 months ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangementβ180Nov 2, 2024Updated last year
- Code for "Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups", IROS 2025β118Jul 9, 2025Updated 7 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.β335Sep 14, 2025Updated 5 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulationβ93Jun 6, 2025Updated 8 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"β70Dec 20, 2024Updated last year
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)β58Jun 7, 2025Updated 8 months ago
- This code corresponds to simulation environments used as part of the DexMimicGen project.β214Dec 6, 2025Updated 2 months ago
- Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".β169Jul 2, 2024Updated last year
- [ICLR 2025] LAPA: Latent Action Pretraining from Videosβ472Jan 22, 2025Updated last year
- β62Dec 14, 2024Updated last year
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Predictionβ41Sep 15, 2025Updated 5 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)β146Jul 9, 2024Updated last year
- Spatial Aptitude Training for Multimodal Langauge Modelsβ24Feb 8, 2026Updated 3 weeks ago
- [IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 2: Humanoid Teleoperationβ195Feb 19, 2025Updated last year
- A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-viβ¦β796Dec 17, 2025Updated 2 months ago
- Official Repository for SAM2Actβ225Aug 23, 2025Updated 6 months ago
- DROID Policy Learning and Evaluationβ270Apr 22, 2025Updated 10 months ago
- β49Nov 28, 2024Updated last year