JiuTian-VL / Optimus-1
[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
☆55Updated 2 weeks ago
Alternatives and similar repositories for Optimus-1:
Users that are interested in Optimus-1 are comparing it to the libraries listed below
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆86Updated this week
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆77Updated this week
- ☆34Updated 3 weeks ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆111Updated 2 months ago
- ☆31Updated last week
- The Official Code Repository for GUI-World.☆46Updated last month
- Towards Large Multimodal Models as Visual Foundation Agents☆169Updated last month
- Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆127Updated last month
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆62Updated 2 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆40Updated 7 months ago
- ☆44Updated last year
- ☆48Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"