DaoyuanLi2816 / pairjudgeView on GitHub
Pairwise LLM judges (A/B/tie): budget-aware multi-turn packing, position-bias correction, pseudo-label distillation. Generalized from the 4th-place (gold) solution to Kaggle LMSYS Chatbot Arena.
169Jun 10, 2026Updated 3 weeks ago

Alternatives and similar repositories for pairjudge

Users that are interested in pairjudge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?