Embodied Reasoning Question Answer (ERQA) Benchmark
β263Mar 12, 2025Updated last year
Alternatives and similar repositories for ERQA
Users that are interested in ERQA are comparing it to the libraries listed below
Sorting:
- [ICLR 2025π] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Larβ¦β93Jan 22, 2025Updated last year
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ283Jul 8, 2025Updated 8 months ago
- [CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.β376Oct 13, 2025Updated 5 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.β374Apr 5, 2025Updated 11 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.β405Nov 11, 2025Updated 4 months ago
- β446Nov 29, 2025Updated 3 months ago
- β36Dec 13, 2023Updated 2 years ago
- RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learningβ1,683Mar 9, 2026Updated last week
- [ICLR 2025] LAPA: Latent Action Pretraining from Videosβ485Jan 22, 2025Updated last year
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long cβ¦β927Jan 6, 2026Updated 2 months ago
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulationβ406Oct 30, 2025Updated 4 months ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangementβ181Nov 2, 2024Updated last year
- A Vision-Language Model for Spatial Affordance Prediction in Roboticsβ214Jul 17, 2025Updated 8 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, β¦β205May 5, 2025Updated 10 months ago
- [World-Model-Survey-2024] Paper list and projects for World Modelβ15Oct 31, 2024Updated last year
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β350Mar 19, 2025Updated last year
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ343Sep 20, 2024Updated last year
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robotsβ1,251Mar 12, 2026Updated last week
- [ICRA 2025] In-Context Imitation Learning via Next-Token Predictionβ109Mar 17, 2025Updated last year
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β5,542Mar 23, 2025Updated 11 months ago
- β10,755Updated this week
- Benchmarking Knowledge Transfer in Lifelong Robot Learningβ1,611Mar 15, 2025Updated last year
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoningβ79May 17, 2025Updated 10 months ago
- β62Apr 1, 2025Updated 11 months ago
- Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligenceβ1,424Jan 31, 2025Updated last year
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actionsβ1,023Nov 19, 2025Updated 4 months ago
- β28Aug 6, 2024Updated last year
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β125Oct 23, 2025Updated 4 months ago
- [NeurIPS 2025] VIKIβR: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learningβ79Mar 11, 2026Updated last week
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasksβ181Dec 15, 2025Updated 3 months ago
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videosβ166Oct 1, 2025Updated 5 months ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasksβ191Sep 24, 2025Updated 5 months ago
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Successβ1,094Sep 9, 2025Updated 6 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.β273Feb 20, 2026Updated last month
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasksβ79Dec 12, 2024Updated last year
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Predictionβ41Sep 15, 2025Updated 6 months ago
- Code for Equivariant Transporter Networkβ23Apr 17, 2023Updated 2 years ago
- DROID Policy Learning and Evaluationβ270Apr 22, 2025Updated 10 months ago
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policyβ107Oct 24, 2024Updated last year