Github action to evaluate AI agent applications using model as the judge, content safety and mathematical metrics.
☆83May 20, 2026Updated 3 weeks ago
Alternatives and similar repositories for ai-agent-evals
Users that are interested in ai-agent-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GitAGU (Git Agent Unblock) - A centralized platform for discovering, configuring, and integrating AI agents into your development workflo…☆30Apr 13, 2026Updated 2 months ago
- The LLMAgentOps Toolkit is a repository that provides a foundational structure for building LLM Agent-based applications using the Semant…☆17Apr 1, 2026Updated 2 months ago
- Tayra is a sophisticated call center analytics platform designed to systematically evaluate and score call center audio interactions. By …☆14Dec 19, 2025Updated 5 months ago
- Playground for building AI Agents on Azure☆31Mar 31, 2025Updated last year
- SK Multi agentic advanced orchestration example☆15Feb 20, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated last year
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 10 months ago
- ☆42Apr 9, 2026Updated 2 months ago
- Evidence-based decision framework for selecting the right Microsoft AI technology (M365 Copilot, Copilot Studio, Azure AI Foundry, Agent …☆57Updated this week
- This lab is a starter for quickly and easily applying SLM/LLM fine-tuning, evaluation, and quantization with torchtune on Azure ML.☆15Updated this week
- End-to-end solution sample for a travel assistant built with the Azure Agent Runtime☆31Apr 2, 2026Updated 2 months ago
- An exploration of the capabilities of GPT-5☆37Sep 4, 2025Updated 9 months ago
- Azure Computer Vision 4 (March 2023 - Florence) workshop in a day☆41May 11, 2023Updated 3 years ago
- VS Code Extension for Copilot Studio☆96Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Aug 6, 2020Updated 5 years ago
- Examples of how-to use Azure OpenAI Log Probabilities (LogProbs) feature to enhance Generative AI - Q&A grounding.☆23May 10, 2025Updated last year
- Microsoft AI Value Accelerator☆32Jul 30, 2024Updated last year
- A service for end-to-end (functional) testing of a bot. Programmatically simulate a user’s back-and-forth conversation with a bot, to tes…☆18May 24, 2026Updated 3 weeks ago
- Windows Data and Analytics Shared Code - JSON Processing☆15Jun 12, 2023Updated 3 years ago
- VS Code native module for loading and reading OS policies☆16Jan 13, 2026Updated 5 months ago
- ☆26Jun 1, 2026Updated 2 weeks ago
- VS Code extension to preview a theme without installing it☆15May 14, 2026Updated last month
- ☆92Jun 5, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Azure AI Agents Playbook☆33Apr 15, 2026Updated 2 months ago
- ☆30May 15, 2026Updated last month
- ☆20Nov 11, 2025Updated 7 months ago
- Magentic-Marketplace: Simulate Agentic Markets and See How They Evolve☆170Mar 1, 2026Updated 3 months ago
- Implement GenAIOps using Azure AI Foundry with ease and jumpstart☆27Apr 13, 2026Updated 2 months ago
- A sample OpenAI plugin using ASP.NET Core API☆18Jun 22, 2023Updated 2 years ago
- ☆37Nov 15, 2024Updated last year
- A refactoring benchmark for software engineering agents. [ICLR 2025]☆27Feb 20, 2026Updated 3 months ago
- Get the assets and code here, and then follow our Bee Control tutorial to learn more about how to work with Unity, C#, and Visual Studio …☆15Jun 30, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Activate GenAI with Azure☆23Jan 26, 2026Updated 4 months ago
- LINEBot☆13Apr 7, 2025Updated last year
- Upgrade a legacy Python project with GitHub Copilot☆19Sep 24, 2025Updated 8 months ago
- A Mixture‑of‑Experts Educational Framework for Adaptive Cybersecurity☆20Feb 8, 2026Updated 4 months ago
- GitHub Extension to pin actions based on their version☆22Updated this week
- A ruby lib to achieve consensus with Cassandra☆11Feb 28, 2020Updated 6 years ago
- Create an MCP Server for your API using the TypeSpec MCP Server☆49May 27, 2026Updated 2 weeks ago