ibm-ecosystem-engineering / JudgeIt-LLM-as-a-Judge

Automation Framework using LLM-as-a-judge to Scale Eval of Gen AI solutions (RAG, Multi-turn, Query Rewrite, Text2SQL etc.); that is a good proxy for human judgement.
23Updated last month

Alternatives and similar repositories for JudgeIt-LLM-as-a-Judge:

Users that are interested in JudgeIt-LLM-as-a-Judge are comparing it to the libraries listed below