microsoft / agdebuggerLinks
☆68Updated 2 weeks ago
Alternatives and similar repositories for agdebugger
Users that are interested in agdebugger are comparing it to the libraries listed below
Sorting:
- ☆80Updated 4 months ago
- Tutorial for building LLM router☆244Updated last year
- Official Repo for CRMArena and CRMArena-Pro☆132Updated this week
- ☆237Updated 2 months ago
- Ranking LLMs on agentic tasks☆210Updated 2 months ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆139Updated 2 weeks ago
- ☆106Updated last year
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆103Updated 6 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆130Updated 3 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆174Updated 2 weeks ago
- DSPY on action with OpenSource LLMs.☆103Updated last year
- A Text-Based Environment for Interactive Debugging☆293Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"☆63Updated 6 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆46Updated 3 weeks ago
- ☆76Updated last year
- A curated list of awesome approaches to AI model routing☆184Updated 10 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆117Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated 11 months ago
- A clean, modular SDK for building AI agents with OpenHands V1.☆476Updated this week
- Beating the GAIA benchmark with Transformers Agents. 🚀☆145Updated 11 months ago
- DIffbot LLM Inference Server☆228Updated 5 months ago
- Agent computer interface for AI software engineer.☆115Updated last month
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆874Updated this week
- ☆217Updated last week
- MCP-based Agent Deep Evaluation System☆143Updated 4 months ago
- Training setup for Langchain's Open Deep Research☆74Updated 5 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆140Updated 5 months ago