microsoft / ai-agent-evalsView on GitHub
Github action to evaluate AI agent applications using model as the judge, content safety and mathematical metrics.
65Jan 16, 2026Updated last month

Alternatives and similar repositories for ai-agent-evals

Users that are interested in ai-agent-evals are comparing it to the libraries listed below

Sorting:

Are these results useful?