computer-agents / agent-studio

Environments, tools, and benchmarks for general computer agents

☆172

Related projects ⓘ

Alternatives and complementary repositories for agent-studio

microsoft / simulated-trial-and-error
☆116Updated 5 months ago
metauto-ai / agent-as-a-judge
🤠 Agent-as-a-Judge and DevAI dataset
☆192Updated this week
OS-Copilot / OS-Atlas
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
☆166Updated this week
McGill-NLP / weblinx
WebLINX is a benchmark for building web navigation agents with conversational capabilities
☆118Updated last month
OpenBMB / Eurus
☆287Updated 2 months ago
THUDM / WebRL
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆204Updated this week
Ag2S1 / Sibyl-System
☆103Updated 3 months ago
XinyuanWangCS / PromptAgent
This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…
☆204Updated 3 months ago
anchen1011 / FireAct
FireAct: Toward Language Agent Fine-tuning
☆255Updated last year
GAIR-NLP / ReAlign
Reformatted Alignment
☆112Updated last month
OSU-NLP-Group / UGround
Official Repo for UGround
☆97Updated last week
zjunlp / AutoAct
[ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning
☆178Updated last month
ranpox / awesome-computer-use
This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.
☆102Updated last week
zorazrw / agent-workflow-memory
AWM: Agent Workflow Memory
☆205Updated last month
SalesforceAIResearch / xLAM
☆316Updated last month
camel-ai / crab
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/
☆191Updated last week
microsoft / Everything-of-Thoughts-XoT
An implemtation of Everyting of Thoughts (XoT).
☆132Updated 8 months ago
hkust-nlp / AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents
☆250Updated 6 months ago
cooelf / Auto-GUI
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
☆198Updated 4 months ago
diagram-of-thought / diagram-of-thought
Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)
☆170Updated last month
kyegomez / Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
☆92Updated last year
THUDM / Android-Lab
☆154Updated 2 weeks ago
web-arena-x / visualwebarena
VisualWebArena is a benchmark for multimodal agents.
☆244Updated last week
NL2Code / CodeR
☆152Updated 2 months ago
kohjingyu / search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
☆138Updated 3 months ago
DigiRL-agent / digirl
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆259Updated last month
zjunlp / KnowAgent
KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
☆172Updated last month
GAIR-NLP / ProX
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
☆191Updated last month
StonyBrookNLP / appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…
☆110Updated 3 weeks ago
NexaAI / octopus-v4
AI for all: Build the large graph of the language models
☆244Updated 5 months ago