DaoyuanLi2816 / pairjudgeView on GitHub
Pairwise LLM judges (A/B/tie): budget-aware multi-turn packing, position-bias correction, pseudo-label distillation. Generalized from the 4th-place (gold) solution to Kaggle LMSYS Chatbot Arena.
169Jun 10, 2026Updated this week

Alternatives and similar repositories for pairjudge

Users that are interested in pairjudge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?