RoboDita / Dita
☆30Updated this week
Alternatives and similar repositories for Dita:
Users that are interested in Dita are comparing it to the libraries listed below
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆51Updated 3 months ago
- ☆63Updated 5 months ago
- ☆59Updated last week
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆80Updated 8 months ago
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆102Updated 8 months ago
- ☆37Updated 4 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆109Updated 5 months ago
- Unified Video Action Model☆128Updated last week
- Official implementation of "Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation"☆76Updated 2 weeks ago
- ☆49Updated this week
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 2 months ago
- ☆67Updated 6 months ago
- ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation☆41Updated this week
- [ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…☆76Updated 4 months ago
- The official repo for the paper "In-Context Imitation Learning via Next-Token Prediction"☆69Updated 2 weeks ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆47Updated 3 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆44Updated 11 months ago
- ☆43Updated this week
- Official implementation of DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning☆57Updated last month
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆54Updated 3 weeks ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."☆63Updated last month
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆98Updated last week
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆44Updated 3 months ago
- This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation☆23Updated 5 months ago
- [ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"☆52Updated last month
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆74Updated 8 months ago
- ☆46Updated 3 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆52Updated 6 months ago
- Manipulate-Anything: Automating Real-World Robots using Vision-Language Models [CoRL 2024]☆24Updated 3 months ago
- ☆51Updated last month