camel-ai / crabLinks
π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/
β372Updated 2 months ago
Alternatives and similar repositories for crab
Users that are interested in crab are comparing it to the libraries listed below
Sorting:
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through inβ¦β752Updated 10 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agentsβ218Updated 2 months ago
- βοΈ The First Coding Agent-as-a-Judgeβ626Updated 3 months ago
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β433Updated 3 months ago
- AI for all: Build the large graph of the language modelsβ275Updated last year
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and β¦β345Updated last year
- An open platform for enhancing the capability of LLMs in workflow orchestration.β170Updated 6 months ago
- AWM: Agent Workflow Memoryβ312Updated 7 months ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi eβ¦β526Updated last week
- Beating the GAIA benchmark with Transformers Agents. πβ136Updated 6 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ450Updated 3 months ago
- β631Updated 7 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ356Updated 6 months ago
- OpenCUA: Open Foundations for Computer-Use Agentsβ458Updated last week
- β800Updated 2 weeks ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agentsβ379Updated 4 months ago
- The evaluation benchmark on MCP serversβ198Updated last week
- [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinkingβ456Updated 3 weeks ago
- An Open-Source AI Writing Project.β366Updated this week
- β310Updated last year
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exeβ¦β252Updated 3 months ago
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β764Updated 4 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systemsβ553Updated 3 weeks ago
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.β198Updated last year
- Code for ScribeAgent paperβ61Updated 6 months ago
- A LLM-based Agent that predict its tasks proactively.β417Updated 3 weeks ago
- [Up-to-date] Awesome Agentic Deep Research Resourcesβ437Updated 2 weeks ago
- This is the official repository for Auto-RAG.β221Updated last month
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycleβ295Updated this week
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β399Updated last week