Embodied Reasoning Question Answer (ERQA) Benchmark
β275Mar 12, 2025Updated last year
Alternatives and similar repositories for ERQA
Users that are interested in ERQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025π] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Larβ¦β95Jan 22, 2025Updated last year
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ303Jul 8, 2025Updated 11 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.β397Apr 5, 2025Updated last year
- [CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.β552Oct 13, 2025Updated 7 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.β438Nov 11, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β37Dec 13, 2023Updated 2 years ago
- β473Apr 14, 2026Updated last month
- RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learningβ1,750Updated this week
- [ICLR 2025] LAPA: Latent Action Pretraining from Videosβ534Jan 22, 2025Updated last year
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long cβ¦β948Updated this week
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulationβ430Oct 30, 2025Updated 7 months ago
- A Vision-Language Model for Spatial Affordance Prediction in Roboticsβ223Jul 17, 2025Updated 10 months ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangementβ185Nov 2, 2024Updated last year
- [ICLR 2026] MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more β¦β208May 5, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [World-Model-Survey-2024] Paper list and projects for World Modelβ15Oct 31, 2024Updated last year
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ361Sep 20, 2024Updated last year
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β366Mar 19, 2025Updated last year
- [ICRA 2025] In-Context Imitation Learning via Next-Token Predictionβ116Mar 17, 2025Updated last year
- RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robotsβ1,452May 21, 2026Updated 2 weeks ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β6,395Mar 23, 2025Updated last year
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoningβ83May 17, 2025Updated last year
- β62Apr 1, 2025Updated last year
- β12,207May 5, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligenceβ1,476Jan 31, 2025Updated last year
- Benchmarking Knowledge Transfer in Lifelong Robot Learningβ1,907Mar 15, 2025Updated last year
- β28Aug 6, 2024Updated last year
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actionsβ1,086Nov 19, 2025Updated 6 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."β130May 26, 2026Updated 2 weeks ago
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasksβ194May 9, 2026Updated last month
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videosβ177Oct 1, 2025Updated 8 months ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasksβ200Apr 9, 2026Updated 2 months ago
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Successβ1,240Sep 9, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS 2025 D&B] VIKIβR: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learningβ93Apr 2, 2026Updated 2 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.β309May 30, 2026Updated last week
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Predictionβ40Sep 15, 2025Updated 8 months ago
- Code for Equivariant Transporter Networkβ23Apr 17, 2023Updated 3 years ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasksβ83Dec 12, 2024Updated last year
- DROID Policy Learning and Evaluationβ287Apr 22, 2025Updated last year
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policyβ108Oct 24, 2024Updated last year