dongyh20 / OctopusLinks

[ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.

☆292

Alternatives and similar repositories for Octopus

Users that are interested in Octopus are comparing it to the libraries listed below

Sorting:

VIRL-Platform / VIRL
(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life
☆360Updated 10 months ago
thunlp / LEGENT
Open Platform for Embodied Agents
☆329Updated 8 months ago
UMass-Embodied-AGI / CoELA
[ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"
☆278Updated 6 months ago
facebookresearch / open-eqa
OpenEQA Embodied Question Answering in the Era of Foundation Models
☆319Updated last year
Zhoues / MineDreamer
[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…
☆95Updated 3 months ago
jlin816 / dynalang
Code for "Learning to Model the World with Language." ICML 2024 Oral.
☆396Updated 2 years ago
IranQin / MP5
[CVPR2024] This is the official implement of MP5
☆104Updated last year
PKU-RL / Creative-Agents
☆45Updated last year
DigiRL-agent / digiq
☆112Updated 6 months ago
OpenGVLab / Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
☆370Updated last year
WebVLN / WebVLN
Official implementation of WebVLN: Vision-and-Language Navigation on Websites
☆29Updated last year
UMass-Embodied-AGI / MultiPLY
Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
☆133Updated 11 months ago
szxiangjn / world-model-for-language-model
☆131Updated last year
zwq2018 / embodied_reasoner
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
☆172Updated 2 weeks ago
CraftJarvis / JarvisVLA
Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"
☆100Updated last month
Gary3410 / TaPA
[arXiv 2023] Embodied Task Planning with Large Language Models
☆192Updated 2 years ago
embodied-generalist / embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
☆464Updated 5 months ago
mindagent / mindagent
☆95Updated last year
OSU-NLP-Group / LLM-Planner
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
☆202Updated 6 months ago
maitrix-org / Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
☆523Updated last year
pkunlp-icler / PCA-EVAL
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
☆103Updated last year
zzxslp / MM-Navigator
GPT-4V in Wonderland: LMMs as Smartphone Agents
☆135Updated last year
CraftJarvis / GROOT
GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR 2024 Spotlight)
☆66Updated last year
RunpeiDong / DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
☆459Updated 10 months ago
apple / ml-llarp
☆84Updated last year
THUDM / VisualAgentBench
Towards Large Multimodal Models as Visual Foundation Agents
☆238Updated 5 months ago
RL4VLM / RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
☆392Updated 9 months ago
EmbodiedBench / EmbodiedBench
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆190Updated 2 months ago
embodied-agent-interface / embodied-agent-interface
Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)
☆258Updated 7 months ago
kyegomez / PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
☆325Updated last year