zorazrw / agent-workflow-memoryLinks

AWM: Agent Workflow Memory

☆297

Alternatives and similar repositories for agent-workflow-memory

Users that are interested in agent-workflow-memory are comparing it to the libraries listed below

Sorting:

kohjingyu / search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
☆208Updated last year
StonyBrookNLP / appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…
☆231Updated 2 months ago
ServiceNow / AgentLab
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…
☆372Updated this week
ByteDance-Seed / Agent-R
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆153Updated last month
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆231Updated 2 months ago
SalesforceAIResearch / xLAM
xLAM: A Family of Large Action Models to Empower AI Agent Systems
☆507Updated last week
OS-Copilot / OS-Atlas
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
☆361Updated 3 months ago
vsubramaniam851 / multiagent-ft
☆211Updated 5 months ago
hkust-nlp / AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]
☆332Updated last year
diagram-of-thought / diagram-of-thought
Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)
☆184Updated 4 months ago
web-arena-x / visualwebarena
VisualWebArena is a benchmark for multimodal agents.
☆364Updated 8 months ago
OSU-NLP-Group / UGround
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆262Updated 2 weeks ago
SALT-NLP / collaborative-gym
Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
☆90Updated 3 months ago
xlang-ai / Spider2-V
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
☆129Updated 11 months ago
THUDM / WebRL
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆430Updated last month
multi-agent-systems-failure-taxonomy / MAST
☆240Updated last week
SWE-bench / SWE-smith
Scaling Data for SWE-agents
☆328Updated this week
OSU-NLP-Group / WebDreamer
"Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
☆78Updated 3 months ago
McGill-NLP / weblinx
WebLINX is a benchmark for building web navigation agents with conversational capabilities
☆156Updated 5 months ago
ServiceNow / TapeAgents
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
☆288Updated this week
agent-husky / Husky-v1
Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …
☆345Updated last year
ltzheng / agent-studio
[ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents
☆212Updated last month
samkhur006 / awesome-llm-planning-reasoning
A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning mate…
☆284Updated 5 months ago
zjunlp / WKM
[NeurIPS 2024] Agent Planning with World Knowledge Model
☆144Updated 7 months ago
facebookresearch / swe-rl
Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆571Updated 4 months ago
zjunlp / AutoAct
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
☆229Updated 6 months ago
TheAgentCompany / TheAgentCompany
An agent benchmark with tasks in a simulated software company.
☆509Updated this week
suzgunmirac / dynamic-cheatsheet
Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory
☆68Updated 2 months ago
SWE-Gym / SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆513Updated this week
zjunlp / OneGen
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
☆148Updated 8 months ago