EmbodiedGPT / EgoCOT_DatasetLinks

☆54

Alternatives and similar repositories for EgoCOT_Dataset

Users that are interested in EgoCOT_Dataset are comparing it to the libraries listed below

Sorting:

declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆78Updated 6 months ago
thunlp / EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
☆54Updated 9 months ago
ChenYi99 / EgoPlan
[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
☆74Updated last year
Gabesarch / HELPER
☆32Updated last year
UMass-Embodied-AGI / MultiPLY
Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
☆134Updated last year
stevenyangyj / Emma-Alfworld
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
☆60Updated last year
Dantong88 / LLARVA
☆60Updated 11 months ago
Gary3410 / TaPA
[arXiv 2023] Embodied Task Planning with Large Language Models
☆193Updated 2 years ago
InternRobotics / InternVLA-A1
InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
☆54Updated 2 months ago
HAWLYQ / ET-Cap
☆24Updated 2 years ago
IranQin / MP5
[CVPR2024] This is the official implement of MP5
☆106Updated last year
wz0919 / VLN-SRDF
Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
☆31Updated 6 months ago
snumprlab / realfred
Official Implementation of ReALFRED (ECCV'24)
☆44Updated last year
snumprlab / capeam
Official Implementation of CAPEAM (ICCV'23)
☆14Updated last year
FlagOpen / ShareRobot
☆59Updated 8 months ago
eric-ai-lab / VLMbench
NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"
☆96Updated 7 months ago
2toinf / IVM
[NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"
☆39Updated last year
hitachi-rd-cv / prompter-alfred
Prompter for Embodied Instruction Following
☆18Updated 2 years ago
2644521362 / SC-MLLM
☆18Updated last year
GR1-Manipulation / GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆44Updated last year
aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆152Updated 8 months ago
EmbodiedBench / EmbodiedBench
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆227Updated last month
alanaai / EVUD
Egocentric Video Understanding Dataset (EVUD)
☆32Updated last year
allenai / embodied-clip
Official codebase for EmbCLIP
☆132Updated 2 years ago
valtsblukis / hlsm
☆45Updated 3 years ago
AdaCheng / EgoThink
[CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…
☆63Updated 8 months ago
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆143Updated 11 months ago
SilongYong / SQA3D
[ICLR 2023] SQA3D for embodied scene understanding and reasoning
☆152Updated 2 years ago
szxiangjn / world-model-for-language-model
☆133Updated last year
WayneMao / RoboMatrix
The Official Implementation of RoboMatrix
☆104Updated 6 months ago