facebookresearch / open-eqaLinks

OpenEQA Embodied Question Answering in the Era of Foundation Models

☆330

Alternatives and similar repositories for open-eqa

Users that are interested in open-eqa are comparing it to the libraries listed below

Sorting:

remyxai / VQASynth
Compose multimodal datasets 🎹
☆503Updated 3 months ago
embodied-generalist / embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
☆465Updated 7 months ago
dongyh20 / Octopus
[ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.
☆292Updated last year
zwq2018 / embodied_reasoner
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
☆179Updated last month
LostXine / LLaRA
[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
☆225Updated 7 months ago
snumprlab / realfred
Official Implementation of ReALFRED (ECCV'24)
☆43Updated last year
AnjieCheng / SpatialRGPT
[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"
☆281Updated 11 months ago
zd11024 / NaviLLM
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
☆211Updated last year
UMass-Embodied-AGI / MultiPLY
Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
☆134Updated last year
EmbodiedBench / EmbodiedBench
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆214Updated last month
Gary3410 / TaPA
[arXiv 2023] Embodied Task Planning with Large Language Models
☆192Updated 2 years ago
vision-x-nyu / thinking-in-space
Official repo and evaluation implementation of VSI-Bench
☆631Updated 3 months ago
SilongYong / SQA3D
[ICLR 2023] SQA3D for embodied scene understanding and reasoning
☆152Updated 2 years ago
IranQin / MP5
[CVPR2024] This is the official implement of MP5
☆106Updated last year
embodiedreasoning / ERQA
Embodied Reasoning Question Answer (ERQA) Benchmark
☆241Updated 8 months ago
thunlp / EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
☆54Updated 9 months ago
pointarena / pointarena
☆29Updated 2 months ago
LatentActionPretraining / LAPA
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆403Updated 10 months ago
embodied-agent-interface / embodied-agent-interface
Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)
☆266Updated 8 months ago
lbaa2022 / LLMTaskPlanning
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)
☆82Updated 5 months ago
EmbodiedGPT / EgoCOT_Dataset
☆54Updated last year
BAAI-DCAI / SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
☆317Updated 2 months ago
OSU-NLP-Group / LLM-Planner
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
☆208Updated 7 months ago
OpenGVLab / Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
☆370Updated last year
WebVLN / WebVLN
Official implementation of WebVLN: Vision-and-Language Navigation on Websites
☆30Updated last year
ChenYi99 / EgoPlan
[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
☆74Updated 11 months ago
kyegomez / PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
☆330Updated last year
InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆269Updated last week
MichalZawalski / embodied-CoT
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆323Updated 7 months ago
szxiangjn / world-model-for-language-model
☆132Updated last year