MARS-EAI / RoboFactory
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
☆20Updated this week
Alternatives and similar repositories for RoboFactory:
Users that are interested in RoboFactory are comparing it to the libraries listed below
- ICLR 2025 Agent-Related Papers☆57Updated 4 months ago
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆83Updated 2 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆178Updated this week
- [CVPR2024] This is the official implement of MP5☆99Updated 9 months ago
- ☆26Updated last week
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆98Updated last week
- A paper list for spatial reasoning☆51Updated last month
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆121Updated 3 weeks ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆62Updated last week
- ☆67Updated 6 months ago
- Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆92Updated last week
- ☆94Updated 7 months ago
- HAZARD challenge☆29Updated last week
- Latent Motion Token as the Bridging Language for Robot Manipulation☆77Updated this week
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆107Updated 2 weeks ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆51Updated 3 months ago
- All about Robotics and AI Agents you need are here☆28Updated 11 months ago
- Video-R1: Towards Super Reasoning Ability in Video Understanding MLLMs☆105Updated last month
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆98Updated 3 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆199Updated 2 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆127Updated 5 months ago
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆64Updated 3 weeks ago
- Fetch citations and abstracts of a Google Scholar paper and generate prompt for LLM☆21Updated 4 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆54Updated 5 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆205Updated 11 months ago
- ☆20Updated last month
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆180Updated 3 weeks ago
- (VillagerAgent ACL 2024) A Graph based Minecraft multi agents framework☆53Updated 2 months ago
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes.☆181Updated last week
- ☆37Updated last month