TencentARC / Moto
Latent Motion Token as the Bridging Language for Robot Manipulation
☆65Updated last month
Alternatives and similar repositories for Moto:
Users that are interested in Moto are comparing it to the libraries listed below
- ☆42Updated last month
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆96Updated last week
- ☆86Updated 5 months ago
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆58Updated 2 weeks ago
- ☆56Updated 4 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆45Updated 3 weeks ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆37Updated 3 weeks ago
- ☆47Updated 3 weeks ago
- LAPA: Latent Action Pretraining from Videos☆136Updated 3 weeks ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆54Updated 2 months ago
- Official implementation of GR-MG☆66Updated this week
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆42Updated 6 months ago
- ☆65Updated last month
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆89Updated 6 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 2 months ago
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆84Updated 3 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆36Updated last month
- List of papers on video-centric robot learning☆12Updated 2 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆124Updated 2 months ago
- An official code repository for the paper "Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation"☆57Updated 2 weeks ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆64Updated last month
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆64Updated 6 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆191Updated 8 months ago
- Official Implementation of CAPEAM (ICCV'23)☆11Updated last month
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆42Updated 5 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆30Updated 3 weeks ago
- ☆43Updated 9 months ago
- ☆56Updated 2 weeks ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆86Updated last week
- RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆20Updated 3 months ago