Embodied Reasoning Question Answer (ERQA) Benchmark
β266Mar 12, 2025Updated last year
Alternatives and similar repositories for ERQA
Users that are interested in ERQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025π] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Larβ¦β93Jan 22, 2025Updated last year
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ290Jul 8, 2025Updated 9 months ago
- [CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.β380Oct 13, 2025Updated 5 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.β381Apr 5, 2025Updated last year
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.β411Nov 11, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β458Nov 29, 2025Updated 4 months ago
- β35Dec 13, 2023Updated 2 years ago
- RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learningβ1,701Updated this week
- [ICLR 2025] LAPA: Latent Action Pretraining from Videosβ502Jan 22, 2025Updated last year
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long cβ¦β931Jan 6, 2026Updated 3 months ago
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulationβ414Oct 30, 2025Updated 5 months ago
- A Vision-Language Model for Spatial Affordance Prediction in Roboticsβ218Jul 17, 2025Updated 8 months ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangementβ183Nov 2, 2024Updated last year
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, β¦β206May 5, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [World-Model-Survey-2024] Paper list and projects for World Modelβ15Oct 31, 2024Updated last year
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ346Sep 20, 2024Updated last year
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β353Mar 19, 2025Updated last year
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robotsβ1,313Mar 18, 2026Updated 3 weeks ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β5,784Mar 23, 2025Updated last year
- [ICRA 2025] In-Context Imitation Learning via Next-Token Predictionβ111Mar 17, 2025Updated last year
- β11,178Mar 29, 2026Updated last week
- Benchmarking Knowledge Transfer in Lifelong Robot Learningβ1,692Mar 15, 2025Updated last year
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoningβ79May 17, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β62Apr 1, 2025Updated last year
- Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligenceβ1,443Jan 31, 2025Updated last year
- β28Aug 6, 2024Updated last year
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actionsβ1,038Nov 19, 2025Updated 4 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β126Oct 23, 2025Updated 5 months ago
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasksβ185Dec 15, 2025Updated 3 months ago
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videosβ170Oct 1, 2025Updated 6 months ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasksβ192Updated this week
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.β282Updated this week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Successβ1,132Sep 9, 2025Updated 7 months ago
- [NeurIPS 2025 D&B] VIKIβR: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learningβ84Apr 2, 2026Updated last week
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Predictionβ41Sep 15, 2025Updated 6 months ago
- Code for Equivariant Transporter Networkβ23Apr 17, 2023Updated 2 years ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasksβ82Dec 12, 2024Updated last year
- DROID Policy Learning and Evaluationβ276Apr 22, 2025Updated 11 months ago
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policyβ107Oct 24, 2024Updated last year