dongyh20 / OctopusLinks
[ECCV2024] πOctopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.
β293Updated last year
Alternatives and similar repositories for Octopus
Users that are interested in Octopus are comparing it to the libraries listed below
Sorting:
- Open Platform for Embodied Agentsβ328Updated 8 months ago
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ316Updated last year
- (ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Lifeβ360Updated 9 months ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"β271Updated 5 months ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulaβ¦β95Updated 3 months ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.β392Updated 2 years ago
- [CVPR2024] This is the official implement of MP5β103Updated last year
- β44Updated last year
- [arXiv 2023] Embodied Task Planning with Large Language Modelsβ190Updated 2 years ago
- β112Updated 5 months ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEOβ460Updated 5 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D Worldβ131Updated 10 months ago
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Modelβ369Updated last year
- β130Updated last year
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasksβ168Updated 3 months ago
- Implementation of "PaLM-E: An Embodied Multimodal Language Model"β323Updated last year
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasksβ84Updated 3 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.β185Updated 2 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)β250Updated 6 months ago
- Official implementation of WebVLN: Vision-and-Language Navigation on Websitesβ29Updated last year
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Modelsβ201Updated 5 months ago
- β49Updated 5 months ago
- Evaluate Multimodal LLMs as Embodied Agentsβ54Updated 7 months ago
- Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"β97Updated 3 weeks ago
- β94Updated last year
- Virtual Community: An Open World for Humans, Robots, and Societyβ172Updated this week
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chainβ104Updated last year
- Pandora: Towards General World Model with Natural Language Actions and Video Statesβ516Updated 11 months ago
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creationβ457Updated 9 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learningβ389Updated 9 months ago