microsoft / debug-gymLinks
A Text-Based Environment for Interactive Debugging
☆265Updated this week
Alternatives and similar repositories for debug-gym
Users that are interested in debug-gym are comparing it to the libraries listed below
Sorting:
- ☆76Updated 6 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆315Updated this week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated 2 months ago
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆423Updated this week
- ☆145Updated last month
- Verifiers for LLM Reinforcement Learning☆75Updated last week
- ☆231Updated 2 months ago
- A framework for optimizing DSPy programs with RL☆172Updated last week
- ☆296Updated last month
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆262Updated 3 weeks ago
- ☆55Updated 7 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆458Updated last month
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 4 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆175Updated 3 weeks ago
- ☆73Updated 2 weeks ago
- Agent computer interface for AI software engineer.☆109Updated 2 weeks ago
- Context Engineering Course with DSPy☆177Updated last month
- Benchmark and optimize LLM inference across frameworks with ease☆41Updated last week
- Together Open Deep Research☆346Updated 5 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆112Updated last week
- Open collaboration infrastructure that enables communication, coordination, trust and payments for The Internet of Agents.☆146Updated this week
- ☆38Updated 3 weeks ago
- Test Generation for Prompts☆138Updated this week
- Scaling Data for SWE-agents☆399Updated this week
- Hugging Face MCP Server☆89Updated this week
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆380Updated last week
- Graphite Agentic Framework by Binome Technologies☆167Updated this week
- An OpenSource Deep Research library with reasoning☆158Updated last week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆418Updated 3 weeks ago
- ☆264Updated 3 months ago