A Vision-Language Model for Spatial Affordance Prediction in Robotics
β214Jul 17, 2025Updated 8 months ago
Alternatives and similar repositories for RoboPoint
Users that are interested in RoboPoint are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025π] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Larβ¦β93Jan 22, 2025Updated last year
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policyβ107Oct 24, 2024Updated last year
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulationβ99Dec 30, 2024Updated last year
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interfaceβ153Oct 17, 2024Updated last year
- [ECCV 2024] π Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipuβ¦β99Nov 26, 2024Updated last year
- β90Sep 23, 2025Updated 6 months ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)β95Jul 16, 2024Updated last year
- Spatial Aptitude Training for Multimodal Langauge Modelsβ25Feb 8, 2026Updated last month
- β133Apr 25, 2023Updated 2 years ago
- ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulationβ921Feb 20, 2025Updated last year
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"β384Aug 17, 2024Updated last year
- β75Jan 8, 2025Updated last year
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learningβ42Oct 10, 2024Updated last year
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulationβ133Sep 8, 2025Updated 6 months ago
- [CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulationβ123Oct 26, 2025Updated 4 months ago
- A unified architecture for multimodal multi-task robotic policy learning.β177Feb 2, 2024Updated 2 years ago
- β58Apr 18, 2025Updated 11 months ago
- [ICRA 2026] Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulationβ135Mar 16, 2026Updated last week
- RialTo Policy Learning Pipelineβ201Sep 17, 2024Updated last year
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"β70Dec 20, 2024Updated last year
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.β338Sep 14, 2025Updated 6 months ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representationsβ1,294Oct 17, 2025Updated 5 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulationβ93Jun 6, 2025Updated 9 months ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangementβ181Nov 2, 2024Updated last year
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Gooβ¦β1,005Dec 20, 2025Updated 3 months ago
- [RSS25] Official implementation of DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learningβ240Jul 18, 2025Updated 8 months ago
- [ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Cloudsβ51Jan 10, 2025Updated last year
- [RSS2025] Code for my paper "You Only Teach Once: Learn One-Shot Bimanual Robotic Manipulation from Video Demonstrations"β133Jul 12, 2025Updated 8 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β126Oct 23, 2025Updated 5 months ago
- Official code for "One-Shot Manipulation Strategy Learning by Making Contact Analogies".β26Feb 7, 2025Updated last year
- [ICLR 2025] LAPA: Latent Action Pretraining from Videosβ492Jan 22, 2025Updated last year
- Official Repository for SAM2Actβ228Aug 23, 2025Updated 7 months ago
- β63Dec 14, 2024Updated last year
- VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation (CoRL 2024)β52Oct 25, 2024Updated last year
- A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-viβ¦β802Dec 17, 2025Updated 3 months ago
- This code corresponds to simulation environments used as part of the DexMimicGen project.β223Dec 6, 2025Updated 3 months ago
- Official Implementation of the paper RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentationβ39Jan 13, 2026Updated 2 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)β148Jul 9, 2024Updated last year
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Predictionβ41Sep 15, 2025Updated 6 months ago