microsoft / agdebuggerLinks
☆65Updated last month
Alternatives and similar repositories for agdebugger
Users that are interested in agdebugger are comparing it to the libraries listed below
Sorting:
- Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"☆63Updated 5 months ago
- ☆235Updated 3 weeks ago
- Ranking LLMs on agentic tasks☆204Updated last month
- A clean, modular SDK for building AI agents with OpenHands V1.☆337Updated this week
- Official Repo for CRMArena and CRMArena-Pro☆126Updated last month
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆137Updated last week
- ☆74Updated last year
- RAG evaluation without the need for "golden answers"☆328Updated last week
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆173Updated 3 weeks ago
- A list of AI memory projects☆262Updated 11 months ago
- Client interface to Cleanlab Studio☆32Updated 10 months ago
- ☆79Updated 2 months ago
- Test Generation for Prompts☆143Updated 2 weeks ago
- A programming framework for agentic AI. Discord: https://discord.gg/pAbnFJrkgZ☆137Updated 10 months ago
- Agent computer interface for AI software engineer.☆115Updated 2 weeks ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆123Updated 2 months ago
- A Text-Based Environment for Interactive Debugging☆284Updated this week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated 2 months ago
- A Lightweight Library for AI Observability☆252Updated 10 months ago
- ☆304Updated 4 months ago
- Catch MCP server issues before your agents do.☆136Updated last week
- DSPY on action with OpenSource LLMs.☆102Updated last year
- Tutorial for building LLM router☆239Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆450Updated 3 months ago
- Together Open Deep Research☆356Updated 8 months ago
- A curated list of awesome approaches to AI model routing☆172Updated 8 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆290Updated this week
- CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, …☆388Updated this week
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆211Updated 4 months ago
- ☆105Updated last year