Embodied Reasoning Question Answer (ERQA) Benchmark
β273Mar 12, 2025Updated last year
Alternatives and similar repositories for ERQA
Users that are interested in ERQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025π] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Larβ¦β95Jan 22, 2025Updated last year
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ299Jul 8, 2025Updated 10 months ago
- [CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.β391Oct 13, 2025Updated 7 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.β394Apr 5, 2025Updated last year
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.β431Nov 11, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- β36Dec 13, 2023Updated 2 years ago
- β471Apr 14, 2026Updated last month
- RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learningβ1,739Updated this week
- [ICLR 2025] LAPA: Latent Action Pretraining from Videosβ523Jan 22, 2025Updated last year
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long cβ¦β945Jan 6, 2026Updated 4 months ago
- A Vision-Language Model for Spatial Affordance Prediction in Roboticsβ221Jul 17, 2025Updated 10 months ago
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulationβ423Oct 30, 2025Updated 6 months ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangementβ183Nov 2, 2024Updated last year
- [ICLR 2026] MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more β¦β207May 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [World-Model-Survey-2024] Paper list and projects for World Modelβ15Oct 31, 2024Updated last year
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ355Sep 20, 2024Updated last year
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β362Mar 19, 2025Updated last year
- [ICRA 2025] In-Context Imitation Learning via Next-Token Predictionβ116Mar 17, 2025Updated last year
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robotsβ1,400May 12, 2026Updated last week
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β6,176Mar 23, 2025Updated last year
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoningβ80May 17, 2025Updated last year
- β11,829May 5, 2026Updated 2 weeks ago
- β63Apr 1, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Benchmarking Knowledge Transfer in Lifelong Robot Learningβ1,829Mar 15, 2025Updated last year
- Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligenceβ1,456Jan 31, 2025Updated last year
- β28Aug 6, 2024Updated last year
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actionsβ1,078Nov 19, 2025Updated 6 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β130Oct 23, 2025Updated 6 months ago
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasksβ190May 9, 2026Updated last week
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videosβ175Oct 1, 2025Updated 7 months ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasksβ196Apr 9, 2026Updated last month
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Successβ1,193Sep 9, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [NeurIPS 2025 D&B] VIKIβR: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learningβ90Apr 2, 2026Updated last month
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.β302Apr 28, 2026Updated 3 weeks ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Predictionβ41Sep 15, 2025Updated 8 months ago
- Code for Equivariant Transporter Networkβ23Apr 17, 2023Updated 3 years ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasksβ83Dec 12, 2024Updated last year
- DROID Policy Learning and Evaluationβ284Apr 22, 2025Updated last year
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policyβ107Oct 24, 2024Updated last year