Kami-code / HandsOnVLM-release
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
☆24Updated last month
Alternatives and similar repositories for HandsOnVLM-release:
Users that are interested in HandsOnVLM-release are comparing it to the libraries listed below
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆41Updated last month
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆43Updated 7 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆41Updated 2 months ago
- G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆34Updated 2 weeks ago
- ☆61Updated 5 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆39Updated last year
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆63Updated 4 months ago
- List of papers on video-centric robot learning☆14Updated 3 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆46Updated 2 months ago
- Code for Stable Control Representations☆23Updated last month
- ☆12Updated 8 months ago
- This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation☆21Updated 3 months ago
- [ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…☆71Updated 2 months ago
- ☆21Updated 3 weeks ago
- [CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation☆32Updated 8 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆77Updated 6 months ago
- Mirage: a zero-shot cross-embodiment policy transfer method. Benchmarking code for cross-embodiment policy transfer.☆19Updated 9 months ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆35Updated 7 months ago
- ☆91Updated 6 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆34Updated last year
- main augmentation script for real world robot dataset.☆34Updated last year
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation☆11Updated 2 months ago
- Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"☆19Updated last year
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆43Updated 10 months ago
- RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆25Updated 4 months ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆27Updated 4 months ago
- ☆43Updated 2 months ago