JiuTian-VL / Optimus-1
[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
☆25Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Optimus-1
- The Official Code Repository for GUI-World.☆41Updated 3 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆38Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆38Updated 4 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆38Updated 7 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆55Updated 5 months ago
- ☆46Updated last week
- Official Repo for UGround☆97Updated last week
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆46Updated 2 weeks ago
- ☆63Updated last month
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆118Updated last month
- FuseAI Project☆76Updated 3 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆56Updated last month
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆103Updated 6 months ago
- ☆21Updated this week
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆42Updated 4 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆62Updated 3 weeks ago
- ☆60Updated 2 weeks ago
- A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆31Updated this week
- Towards Large Multimodal Models as Visual Foundation Agents☆122Updated last week
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆124Updated 3 weeks ago
- A repository for research on medium sized language models.☆74Updated 5 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆17Updated 8 months ago
- ☆116Updated 5 months ago
- ☆76Updated 5 months ago
- ☆35Updated last year
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆56Updated 5 months ago
- This is the official repository for Inheritune.☆105Updated last month
- ☆59Updated last month