microsoft / agdebuggerLinks
☆62Updated 2 weeks ago
Alternatives and similar repositories for agdebugger
Users that are interested in agdebugger are comparing it to the libraries listed below
Sorting:
- ☆231Updated 4 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆136Updated 3 months ago
- Ranking LLMs on agentic tasks☆198Updated 2 months ago
- ☆79Updated last month
- Routing on Random Forest (RoRF)☆218Updated last year
- Official Repo for CRMArena and CRMArena-Pro☆121Updated 4 months ago
- A list of AI memory projects☆239Updated 9 months ago
- ☆101Updated last year
- MCP-based Agent Deep Evaluation System☆136Updated last month
- Super basic implementation (gist-like) of RLMs with REPL environments.☆242Updated 3 weeks ago
- A Lightweight Library for AI Observability☆251Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆136Updated 2 months ago
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 3 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆447Updated 2 months ago
- Tutorial for building LLM router☆233Updated last year
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆196Updated 2 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆239Updated this week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated 2 weeks ago
- Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"☆52Updated 3 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆121Updated 9 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆129Updated last month
- Official Implementation of "Affordable AI Assistants with Knowledge Graph of Thoughts"☆172Updated last month
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆105Updated 10 months ago
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆112Updated 3 months ago
- Synthetic Data Engine 💎☆67Updated last week
- ☆73Updated last year
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆133Updated 2 weeks ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆57Updated 8 months ago