microsoft / debug-gymLinks

A Text-Based Environment for Interactive Debugging

☆225

Alternatives and similar repositories for debug-gym

Users that are interested in debug-gym are comparing it to the libraries listed below

Sorting:

LLMSELECTOR / LLMSELECTOR
☆69Updated 4 months ago
anthropic-experimental / agentic-misalignment
☆211Updated last week
togethercomputer / open_deep_research
Together Open Deep Research
☆314Updated 2 months ago
invariantlabs-ai / explorer
A better way of testing, inspecting, and analyzing AI Agent traces.
☆38Updated 3 weeks ago
PrimeIntellect-ai / genesys
☆127Updated 3 months ago
ibm-granite / granite-guardian
The Granite Guardian models are designed to detect risks in prompts and responses.
☆88Updated this week
SWE-agent / SWE-ReX
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
☆228Updated last week
NousResearch / atropos
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …
☆514Updated this week
willccbb / claude-code-mcp
Letting Claude Code develop his own MCP tools :)
☆113Updated 3 months ago
Mirascope / lilypad
An open-source prompt engineering framework.
☆155Updated this week
shreyaskarnik / huggingface-mcp-server
☆51Updated 3 months ago
microsoft / promptpex
Test Generation for Prompts
☆105Updated last week
All-Hands-AI / open-operator
Open-source resources on agents for computer use.
☆350Updated 5 months ago
convergence-ai / webgames
Challenges for general-purpose web-browsing AI agents
☆58Updated 3 weeks ago
Intelligent-Internet / ii-researcher
II-Researcher: a new open-source framework designed to aid building search / research agents
☆376Updated last month
jtanningbed / mcp-ag2-example
a simple example demonstrating MCP + ag2 (autogen) integration
☆41Updated 6 months ago
hide-org / hide
🤖 Headless IDE for AI agents
☆191Updated 2 months ago
Unsiloed-AI / Unsiloed-chunker
☆124Updated this week
mozilla-ai / any-agent
A single interface to use and evaluate different agent frameworks
☆499Updated this week
tom-doerr / simpledspy
☆113Updated 2 weeks ago
run-llama / llamacloud-mcp
☆122Updated 2 weeks ago
Aider-AI / polyglot-benchmark
Coding problems used in aider's polyglot benchmark
☆142Updated 6 months ago
pyember / ember
☆182Updated 2 months ago
WujiangXu / AgenticMemory
Code implementation for paper "A-mem: Agentic Memory for LLM Agents"
☆459Updated last month
adobe-research / dynasaur
Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"
☆343Updated 6 months ago
vitali87 / code-graph-rag
Search Monorepos and get relevant answers
☆435Updated this week
All-Hands-AI / openhands-aci
Agent computer interface for AI software engineer.
☆85Updated this week
topoteretes / awesome-ai-memory
A list of AI memory projects
☆156Updated 5 months ago
METR / eval-analysis-public
Public repository containing METR's DVC pipeline for eval data analysis
☆70Updated 2 months ago
cognitivecomputations / dolphin-logger
☆100Updated 2 weeks ago