camel-ai / crab
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/
☆191Updated last week
Related projects ⓘ
Alternatives and complementary repositories for crab
- AWM: Agent Workflow Memory☆205Updated last month
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆204Updated this week
- An implemtation of Everyting of Thoughts (XoT).☆132Updated 8 months ago
- Environments, tools, and benchmarks for general computer agents☆172Updated 3 weeks ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆64Updated this week
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆170Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆328Updated 5 months ago
- ☆103Updated 3 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆448Updated 8 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆166Updated this week
- ☆116Updated 5 months ago
- ☆316Updated last month
- ☆152Updated 2 months ago
- Official Repo for UGround☆97Updated last week
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆118Updated last month
- An Analytical Evaluation Board of Multi-turn LLM Agents☆250Updated 6 months ago
- EcoAssistant: using LLM assistant more affordably and accurately☆129Updated 4 months ago
- A compilation of the best multi-agent papers☆258Updated 2 weeks ago
- ☆282Updated 7 months ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆496Updated 5 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆62Updated 3 weeks ago
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆138Updated last week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- Code for the paper 🌳 Tree Search for Language Model Agents☆138Updated 3 months ago
- AI for all: Build the large graph of the language models☆244Updated 5 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆92Updated last year
- 🤠 Agent-as-a-Judge and DevAI dataset☆192Updated this week
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆327Updated 9 months ago
- A memory framework for Large Language Models and Agents.☆162Updated 3 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆110Updated 5 months ago