microsoft / debug-gymLinks
A Text-Based Environment for Interactive Debugging
☆225Updated this week
Alternatives and similar repositories for debug-gym
Users that are interested in debug-gym are comparing it to the libraries listed below
Sorting:
- ☆69Updated 4 months ago
- ☆211Updated last week
- Together Open Deep Research☆314Updated 2 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆38Updated 3 weeks ago
- ☆127Updated 3 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆88Updated this week
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆228Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆514Updated this week
- Letting Claude Code develop his own MCP tools :)☆113Updated 3 months ago
- An open-source prompt engineering framework.☆155Updated this week
- ☆51Updated 3 months ago
- Test Generation for Prompts☆105Updated last week
- Open-source resources on agents for computer use.☆350Updated 5 months ago
- Challenges for general-purpose web-browsing AI agents☆58Updated 3 weeks ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆376Updated last month
- a simple example demonstrating MCP + ag2 (autogen) integration☆41Updated 6 months ago
- 🤖 Headless IDE for AI agents☆191Updated 2 months ago
- ☆124Updated this week
- A single interface to use and evaluate different agent frameworks☆499Updated this week
- ☆113Updated 2 weeks ago
- ☆122Updated 2 weeks ago
- Coding problems used in aider's polyglot benchmark☆142Updated 6 months ago
- ☆182Updated 2 months ago
- Code implementation for paper "A-mem: Agentic Memory for LLM Agents"☆459Updated last month
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆343Updated 6 months ago
- Search Monorepos and get relevant answers☆435Updated this week
- Agent computer interface for AI software engineer.☆85Updated this week
- A list of AI memory projects☆156Updated 5 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆70Updated 2 months ago
- ☆100Updated 2 weeks ago