dongyh20 / OctopusLinks
[ECCV2024] πOctopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.
β289Updated last year
Alternatives and similar repositories for Octopus
Users that are interested in Octopus are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"β259Updated 2 months ago
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ291Updated 9 months ago
- Open Platform for Embodied Agentsβ321Updated 5 months ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.β388Updated last year
- [arXiv 2023] Embodied Task Planning with Large Language Modelsβ188Updated last year
- β130Updated 11 months ago
- (ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Lifeβ355Updated 6 months ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulaβ¦β91Updated last week
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D Worldβ130Updated 8 months ago
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasksβ134Updated 3 weeks ago
- β44Updated last year
- [CVPR2024] This is the official implement of MP5β102Updated 11 months ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEOβ443Updated 2 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)β209Updated 3 months ago
- β77Updated last year
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Modelβ365Updated last year
- Towards Large Multimodal Models as Visual Foundation Agentsβ216Updated 2 months ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chainβ105Updated last year
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Modelsβ190Updated 3 months ago
- [NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agentsβ315Updated last year
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agenβ¦β278Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"β166Updated 6 months ago
- β106Updated 2 months ago
- β29Updated 9 months ago
- Official implementation of WebVLN: Vision-and-Language Navigation on Websitesβ28Updated last year
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.β135Updated 2 weeks ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.β267Updated 2 months ago
- [COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMsβ143Updated 10 months ago
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creationβ448Updated 6 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasksβ76Updated last week