HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
☆41Sep 15, 2025Updated 5 months ago
Alternatives and similar repositories for HandsOnVLM-release
Users that are interested in HandsOnVLM-release are comparing it to the libraries listed below
Sorting:
- [CVPR'25] How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions☆32Oct 5, 2025Updated 4 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆79Dec 12, 2024Updated last year
- Official code for "One-Shot Manipulation Strategy Learning by Making Contact Analogies".☆26Feb 7, 2025Updated last year
- Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"☆21Jul 4, 2023Updated 2 years ago
- [RSS2025] Code for my paper "You Only Teach Once: Learn One-Shot Bimanual Robotic Manipulation from Video Demonstrations"☆130Jul 12, 2025Updated 7 months ago
- [TASE 2025] Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter☆35Oct 27, 2025Updated 4 months ago
- ☆29Dec 9, 2025Updated 2 months ago
- This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation☆32Oct 30, 2024Updated last year
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆163Oct 1, 2025Updated 5 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆93Jun 6, 2025Updated 8 months ago
- ☆39Mar 26, 2025Updated 11 months ago
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface☆150Oct 17, 2024Updated last year
- [ICRA, 2025] SplatSim: Zero-Shot Sim2Real Transfer of RGB Manipulation Policies Using Gaussian Splatting☆142Sep 4, 2025Updated 5 months ago
- ☆68Jan 8, 2025Updated last year
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆280Jul 8, 2025Updated 7 months ago
- ☆48May 5, 2025Updated 9 months ago
- [ICLR 2025🎉] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Lar…☆91Jan 22, 2025Updated last year
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆122May 8, 2025Updated 9 months ago
- [CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation☆121Oct 26, 2025Updated 4 months ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆94Jul 16, 2024Updated last year
- ☆37Jan 23, 2026Updated last month
- ☆132Apr 25, 2023Updated 2 years ago
- OpenVLA for AIRBOT☆15Aug 15, 2024Updated last year
- Joint trajectory planning for constrained manipulation using the Closed-Chain Affordance framework by Janak Panthi☆11Jan 19, 2026Updated last month
- [CVPR 2024] Dataset and Code for "Language-driven Grasp Detection."☆48Feb 9, 2025Updated last year
- Code release for SceneReplica paper.☆29Jul 24, 2025Updated 7 months ago
- ☆128Jan 22, 2026Updated last month
- ☆103Dec 4, 2025Updated 2 months ago
- Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"☆180Oct 19, 2023Updated 2 years ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆41Oct 2, 2024Updated last year
- DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated Objects, CVPR 2023☆143Aug 5, 2024Updated last year
- Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"☆49Jun 16, 2024Updated last year
- ☆27Jul 21, 2024Updated last year
- [ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…☆96Nov 26, 2024Updated last year
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆213Jul 17, 2025Updated 7 months ago
- Zero-Cost Whole-Body Teleoperation for Mobile Manipulation☆11Mar 4, 2025Updated 11 months ago
- [2023 CoRL] Leveraging 3D Reconstruction for Mechanical Search on Cluttered Shelves☆11Dec 12, 2024Updated last year
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆115Apr 14, 2025Updated 10 months ago
- ☆75Jan 8, 2025Updated last year