microsoft / ai-agent-evalsView on GitHub
Github action to evaluate AI agent applications using model as the judge, content safety and mathematical metrics.
83May 20, 2026Updated 3 weeks ago

Alternatives and similar repositories for ai-agent-evals

Users that are interested in ai-agent-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?