microsoft / debug-gymLinks
A Text-Based Environment for Interactive Debugging
☆236Updated this week
Alternatives and similar repositories for debug-gym
Users that are interested in debug-gym are comparing it to the libraries listed below
Sorting:
- ☆71Updated 4 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆247Updated this week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆383Updated last week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆39Updated last week
- ☆215Updated 2 weeks ago
- ☆306Updated 3 weeks ago
- ☆141Updated last week
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆52Updated 2 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆91Updated 3 weeks ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆82Updated 4 months ago
- Together Open Deep Research☆320Updated 3 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆95Updated 3 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆346Updated 6 months ago
- Tutorial for building LLM router☆217Updated last year
- ☆36Updated 5 months ago
- An OpenSource Deep Research library with reasoning☆148Updated last month
- Claude Code dashboard with usage stats, error analysis, and sharable feature☆291Updated this week
- Scaling Data for SWE-agents☆293Updated this week
- ☆129Updated 3 months ago
- Agent computer interface for AI software engineer.☆89Updated this week
- ☆55Updated 3 months ago
- a simple example demonstrating MCP + ag2 (autogen) integration☆41Updated 7 months ago
- An agent benchmark with tasks in a simulated software company.☆488Updated last week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- A framework for optimizing DSPy programs with RL☆91Updated this week
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆226Updated last week
- Letting Claude Code develop his own MCP tools :)☆114Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆73Updated this week
- Open-source versioning, tracing, and annotation tooling.☆165Updated this week
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆122Updated this week