microsoft / ai-agent-evalsView on GitHub
Github action to evaluate AI agent applications using model as the judge, content safety and mathematical metrics.
70Mar 13, 2026Updated last week

Alternatives and similar repositories for ai-agent-evals

Users that are interested in ai-agent-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?