Github action to evaluate AI agent applications using model as the judge, content safety and mathematical metrics.
☆81May 20, 2026Updated this week
Alternatives and similar repositories for ai-agent-evals
Users that are interested in ai-agent-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GitAGU (Git Agent Unblock) - A centralized platform for discovering, configuring, and integrating AI agents into your development workflo…☆30Apr 13, 2026Updated last month
- The LLMAgentOps Toolkit is a repository that provides a foundational structure for building LLM Agent-based applications using the Semant…☆17Apr 1, 2026Updated last month
- Tayra is a sophisticated call center analytics platform designed to systematically evaluate and score call center audio interactions. By …☆14Dec 19, 2025Updated 5 months ago
- Playground for building AI Agents on Azure☆31Mar 31, 2025Updated last year
- SK Multi agentic advanced orchestration example☆15Feb 20, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated last year
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 9 months ago
- ReMe: A Personalized Cognitive Training Framework Based on an LLM Voice Chatbot for Research☆18Jul 3, 2025Updated 10 months ago
- ☆42Apr 9, 2026Updated last month
- The Doc Intelligence in-a-Box project leverages Azure AI Document Intelligence to extract data from PDF forms and store the data in a Azu…☆47Mar 27, 2026Updated 2 months ago
- Evidence-based decision framework for selecting the right Microsoft AI technology (M365 Copilot, Copilot Studio, Azure AI Foundry, Agent …☆54May 18, 2026Updated last week
- This lab is a starter for quickly and easily applying SLM/LLM fine-tuning, evaluation, and quantization with torchtune on Azure ML.☆15Apr 21, 2026Updated last month
- End-to-end solution sample for a travel assistant built with the Azure Agent Runtime☆31Apr 2, 2026Updated last month
- An exploration of the capabilities of GPT-5☆37Sep 4, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Azure Computer Vision 4 (March 2023 - Florence) workshop in a day☆41May 11, 2023Updated 3 years ago
- VS Code Extension for Copilot Studio☆93Updated this week
- ☆12Aug 6, 2020Updated 5 years ago
- A service for end-to-end (functional) testing of a bot. Programmatically simulate a user’s back-and-forth conversation with a bot, to tes…☆18May 18, 2026Updated last week
- Windows Data and Analytics Shared Code - JSON Processing☆15Jun 12, 2023Updated 2 years ago
- Hyperparameter Tuning for Deep Learning☆16Feb 5, 2020Updated 6 years ago
- ☆27Nov 27, 2025Updated 5 months ago
- VS Code extension to preview a theme without installing it☆15May 14, 2026Updated last week
- ☆88Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Azure AI Agents Playbook☆33Apr 15, 2026Updated last month
- ☆30May 15, 2026Updated last week
- ☆20Nov 11, 2025Updated 6 months ago
- Magentic-Marketplace: Simulate Agentic Markets and See How They Evolve☆160Mar 1, 2026Updated 2 months ago
- ☆43Feb 11, 2026Updated 3 months ago
- Implement GenAIOps using Azure AI Foundry with ease and jumpstart☆27Apr 13, 2026Updated last month
- A sample OpenAI plugin using ASP.NET Core API☆18Jun 22, 2023Updated 2 years ago
- ☆37Nov 15, 2024Updated last year
- A refactoring benchmark for software engineering agents. [ICLR 2025]☆26Feb 20, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Get the assets and code here, and then follow our Bee Control tutorial to learn more about how to work with Unity, C#, and Visual Studio …☆15Jun 30, 2016Updated 9 years ago
- Activate GenAI with Azure☆23Jan 26, 2026Updated 4 months ago
- Upgrade a legacy Python project with GitHub Copilot☆19Sep 24, 2025Updated 8 months ago
- A ruby lib to achieve consensus with Cassandra☆11Feb 28, 2020Updated 6 years ago
- Create an MCP Server for your API using the TypeSpec MCP Server☆48Apr 29, 2026Updated 3 weeks ago
- Scaling AOAI using APIM, PTUs and TPMs☆114May 17, 2024Updated 2 years ago
- Bundle of security analysis scripts for keras tensorflow models☆16Apr 15, 2024Updated 2 years ago