bdaiinstitute / theiaLinks
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
☆245Updated 4 months ago
Alternatives and similar repositories for theia
Users that are interested in theia are comparing it to the libraries listed below
Sorting:
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆96Updated 3 months ago
- [ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy☆221Updated 4 months ago
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆179Updated 3 weeks ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆231Updated 4 months ago
- Official Repository of SAM2Act☆109Updated last month
- ☆228Updated 4 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆255Updated 2 weeks ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆341Updated 6 months ago
- Code for subgoal synthesis via image editing☆142Updated last year
- ☆53Updated 7 months ago
- Embodied Reasoning Question Answer (ERQA) Benchmark☆197Updated 4 months ago
- ☆253Updated 11 months ago
- ☆203Updated last year
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆115Updated 10 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆271Updated last year
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆240Updated last month
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆225Updated last year
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation☆311Updated 2 months ago
- ☆120Updated 2 years ago
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆150Updated 9 months ago
- Official Reporsitory of "RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation"☆119Updated 2 months ago
- Distributed Robot Interaction Dataset.☆230Updated 5 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆106Updated last week
- DROID Policy Learning and Evaluation☆216Updated 3 months ago
- Nvidia GEAR Lab's initiative to solve the robotics data problem using world models☆247Updated last month
- ICCV2025☆112Updated this week
- F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Mani…☆207Updated last year
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆85Updated last year
- Autoregressive Policy for Robot Learning (RA-L 2025)☆132Updated 4 months ago
- Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"☆181Updated 8 months ago