[ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents
☆228Jun 16, 2025Updated 8 months ago
Alternatives and similar repositories for agent-studio
Users that are interested in agent-studio are comparing it to the libraries listed below
Sorting:
- (ICLR 2025) The Official Code Repository for GUI-World.☆68Dec 18, 2024Updated last year
- ☆20Apr 24, 2024Updated last year
- The model, data and code for the visual GUI Agent SeeClick☆469Jul 13, 2025Updated 7 months ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆255Jul 16, 2024Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆136Updated this week
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆300Jul 18, 2025Updated 7 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,608Updated this week
- ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)☆572Nov 25, 2024Updated last year
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆68Jan 7, 2026Updated last month
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 5 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆947Nov 5, 2025Updated 4 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆737Sep 11, 2025Updated 5 months ago
- Detecting Drift in a Diabetes Dataset using Taipy☆12May 19, 2025Updated 9 months ago
- GPT4 based personalized ArXiv paper assistant bot☆12Mar 1, 2024Updated 2 years ago
- [NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆257Jan 29, 2025Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆826Feb 3, 2025Updated last year
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆147Jan 3, 2026Updated 2 months ago
- VisualWebArena is a benchmark for multimodal agents.☆440Nov 9, 2024Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Feb 16, 2026Updated 2 weeks ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆18Dec 22, 2025Updated 2 months ago
- AI Search engine☆13Sep 24, 2025Updated 5 months ago
- 🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…☆377Feb 17, 2026Updated 2 weeks ago
- Scaling Agentic Environments Automatically.☆54Jan 22, 2026Updated last month
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆518Apr 22, 2024Updated last year
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆346Jun 16, 2024Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆139Aug 26, 2024Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- Official Repository of "GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration".☆42Mar 28, 2025Updated 11 months ago
- LangChain + LiteLLM that works☆50Sep 1, 2025Updated 6 months ago
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆2,455Nov 7, 2024Updated last year
- An LLM-based Web Navigating Agent (KDD'24)☆929Sep 27, 2024Updated last year
- AndroidWorld is an environment and benchmark for autonomous agents☆640Feb 24, 2026Updated last week
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,353Nov 26, 2025Updated 3 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆70Dec 9, 2024Updated last year
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆234Feb 23, 2026Updated last week
- Taipy Demo of a Realtime Dashboard of Air Pollution around a Factory☆17May 20, 2025Updated 9 months ago
- [EMNLP 2022] The baseline code for META-GUI dataset☆14Jul 9, 2024Updated last year
- ☆53Feb 19, 2025Updated last year