Github action to evaluate AI agent applications using model as the judge, content safety and mathematical metrics.
☆75Mar 13, 2026Updated last month
Alternatives and similar repositories for ai-agent-evals
Users that are interested in ai-agent-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GitAGU (Git Agent Unblock) - A centralized platform for discovering, configuring, and integrating AI agents into your development workflo…☆26Mar 12, 2026Updated last month
- The LLMAgentOps Toolkit is a repository that provides a foundational structure for building LLM Agent-based applications using the Semant…☆16Apr 1, 2026Updated last week
- SK Multi agentic advanced orchestration example☆15Feb 20, 2026Updated last month
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated 11 months ago
- ReMe: A Personalized Cognitive Training Framework Based on an LLM Voice Chatbot for Research☆18Jul 3, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The Doc Intelligence in-a-Box project leverages Azure AI Document Intelligence to extract data from PDF forms and store the data in a Azu…☆45Mar 27, 2026Updated 2 weeks ago
- ☆42Updated this week
- This lab is a starter for quickly and easily applying SLM/LLM fine-tuning, evaluation, and quantization with torchtune on Azure ML.☆15Updated this week
- End-to-end solution sample for a travel assistant built with the Azure Agent Runtime☆30Apr 2, 2026Updated last week
- An exploration of the capabilities of GPT-5☆36Sep 4, 2025Updated 7 months ago
- VS Code Extension for Copilot Studio☆82Updated this week
- Azure Computer Vision 4 (March 2023 - Florence) workshop in a day☆42May 11, 2023Updated 2 years ago
- ☆12Aug 6, 2020Updated 5 years ago
- Examples of how-to use Azure OpenAI Log Probabilities (LogProbs) feature to enhance Generative AI - Q&A grounding.☆23May 10, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Microsoft AI Value Accelerator☆33Jul 30, 2024Updated last year
- A service for end-to-end (functional) testing of a bot. Programmatically simulate a user’s back-and-forth conversation with a bot, to tes…☆19Feb 12, 2026Updated 2 months ago
- Windows Data and Analytics Shared Code - JSON Processing☆15Jun 12, 2023Updated 2 years ago
- VS Code native module for loading and reading OS policies☆16Jan 13, 2026Updated 3 months ago
- Hyperparameter Tuning for Deep Learning☆16Feb 5, 2020Updated 6 years ago
- ☆28Nov 27, 2025Updated 4 months ago
- Magentic-Marketplace: Simulate Agentic Markets and See How They Evolve☆155Mar 1, 2026Updated last month
- Azure AI Agents Playbook☆33Jan 27, 2026Updated 2 months ago
- ☆74Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Nov 11, 2025Updated 5 months ago
- ☆43Feb 11, 2026Updated 2 months ago
- Implement GenAIOps using Azure AI Foundry with ease and jumpstart☆26Apr 23, 2025Updated 11 months ago
- A sample OpenAI plugin using ASP.NET Core API☆17Jun 22, 2023Updated 2 years ago
- ☆36Nov 15, 2024Updated last year
- A refactoring benchmark for software engineering agents. [ICLR 2025]☆23Feb 20, 2026Updated last month
- Get the assets and code here, and then follow our Bee Control tutorial to learn more about how to work with Unity, C#, and Visual Studio …☆15Jun 30, 2016Updated 9 years ago
- Activate GenAI with Azure☆23Jan 26, 2026Updated 2 months ago
- LINEBot☆13Apr 7, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Upgrade a legacy Python project with GitHub Copilot☆19Sep 24, 2025Updated 6 months ago
- A Mixture‑of‑Experts Educational Framework for Adaptive Cybersecurity☆22Feb 8, 2026Updated 2 months ago
- Create an MCP Server for your API using the TypeSpec MCP Server☆46Feb 4, 2026Updated 2 months ago
- Scaling AOAI using APIM, PTUs and TPMs☆114May 17, 2024Updated last year
- MCP server for the windows API.☆22Apr 22, 2025Updated 11 months ago
- Bundle of security analysis scripts for keras tensorflow models☆16Apr 15, 2024Updated last year
- AI-in-One Dashboard Power BI template for comprehensive AI usage analytics☆34Apr 3, 2026Updated last week