microsoft / debug-gym
A Text-Based Environment for Interactive Debugging
☆179Updated this week
Alternatives and similar repositories for debug-gym:
Users that are interested in debug-gym are comparing it to the libraries listed below
- ☆64Updated 2 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆35Updated this week
- Agent testing library that uses an agent to test your agent☆81Updated this week
- XTR/WARP is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆123Updated 6 months ago
- Verdict is a library for scaling judge-time compute.☆199Updated last week
- ☆85Updated 3 months ago
- ☆36Updated 2 months ago
- ☆85Updated 3 months ago
- A list of AI memory projects☆96Updated 3 months ago
- Let Claude control a web browser on your machine.☆26Updated 2 months ago
- 🤖 Headless IDE for AI agents☆183Updated last week
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆337Updated 4 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆78Updated last month
- Minimal example of MCP for parsing llms.txt☆35Updated 2 weeks ago
- For LLMs to better code with Jina API☆145Updated last week
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆115Updated last month
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆162Updated this week
- Test Generation for Prompts☆70Updated this week
- Letting Claude Code develop his own MCP tools :)☆99Updated last month
- ☆46Updated 2 weeks ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆90Updated 6 months ago
- ☆41Updated last month
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆209Updated 2 months ago
- ☆87Updated 2 months ago
- Routing on Random Forest (RoRF)☆147Updated 7 months ago
- Helping you select an AI agent framework☆143Updated this week
- a minimalistic template for dynamic self-building AI agents☆97Updated 3 months ago
- Readymade evaluators for agent trajectories☆169Updated 3 weeks ago
- ☆38Updated 3 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆146Updated 3 months ago