robomonkey-vla / RoboMonkeyLinks
☆23Updated last month
Alternatives and similar repositories for RoboMonkey
Users that are interested in RoboMonkey are comparing it to the libraries listed below
Sorting:
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆80Updated last year
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆42Updated 2 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆67Updated 11 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆57Updated 6 months ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆57Updated 7 months ago
- ☆60Updated 11 months ago
- ☆88Updated last year
- ☆135Updated 5 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆111Updated 7 months ago
- ☆56Updated 3 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆99Updated last year
- ☆51Updated 7 months ago
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆40Updated last year
- ☆41Updated 5 months ago
- [ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆93Updated 5 months ago
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆151Updated 2 months ago
- List of papers on video-centric robot learning☆22Updated last year
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆45Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆45Updated 2 years ago
- Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression☆43Updated 9 months ago
- [CVPR 2024] Binding Touch to Everything: Learning Unified Multimodal Tactile Representations☆72Updated 3 weeks ago
- [ICCV 2025] 2D version of Dense Policy☆31Updated 4 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆32Updated 6 months ago
- Code Repository for ControlVLA, CoRL2025.☆77Updated last month
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆22Updated 9 months ago
- [RA-L 2025] Motion Before Action: Diffusing Object Motion as Manipulation Condition☆66Updated last month
- ☆34Updated last year
- main augmentation script for real world robot dataset.☆36Updated 2 years ago
- ☆26Updated last year
- ☆38Updated 4 months ago