Embodied Reasoning Question Answer (ERQA) Benchmark
β276Mar 12, 2025Updated last year
Alternatives and similar repositories for ERQA
Users that are interested in ERQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025π] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Larβ¦β98Jan 22, 2025Updated last year
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ305Jul 8, 2025Updated 11 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.β403Apr 5, 2025Updated last year
- [CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.β554Oct 13, 2025Updated 8 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.β443Nov 11, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β37Dec 13, 2023Updated 2 years ago
- β475Apr 14, 2026Updated 2 months ago
- RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learningβ1,756Updated this week
- [ICLR 2025] LAPA: Latent Action Pretraining from Videosβ549Jan 22, 2025Updated last year
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long cβ¦β949Jun 7, 2026Updated 3 weeks ago
- A Vision-Language Model for Spatial Affordance Prediction in Roboticsβ226Jul 17, 2025Updated 11 months ago
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulationβ428Oct 30, 2025Updated 8 months ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangementβ185Nov 2, 2024Updated last year
- [ICLR 2026] MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more β¦β209May 5, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [World-Model-Survey-2024] Paper list and projects for World Modelβ15Oct 31, 2024Updated last year
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ365Sep 20, 2024Updated last year
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β368Mar 19, 2025Updated last year
- [ICRA 2025] In-Context Imitation Learning via Next-Token Predictionβ119Mar 17, 2025Updated last year
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robotsβ1,497Updated this week
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β6,509Mar 23, 2025Updated last year
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoningβ84May 17, 2025Updated last year
- β62Apr 1, 2025Updated last year
- β12,522Jun 16, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligenceβ1,489Jan 31, 2025Updated last year
- Benchmarking Knowledge Transfer in Lifelong Robot Learningβ1,983Mar 15, 2025Updated last year
- β28Aug 6, 2024Updated last year
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actionsβ1,096Nov 19, 2025Updated 7 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β130May 26, 2026Updated last month
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasksβ194May 9, 2026Updated last month
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videosβ180Oct 1, 2025Updated 9 months ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasksβ201Apr 9, 2026Updated 2 months ago
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Successβ1,270Sep 9, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2025 D&B] VIKIβR: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learningβ97Apr 2, 2026Updated 2 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.β313May 30, 2026Updated last month
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Predictionβ41Sep 15, 2025Updated 9 months ago
- Code for Equivariant Transporter Networkβ23Apr 17, 2023Updated 3 years ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasksβ83Dec 12, 2024Updated last year
- DROID Policy Learning and Evaluationβ289Apr 22, 2025Updated last year
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policyβ109Oct 24, 2024Updated last year