Q-Future / A-BenchView on GitHub
[ICLR 2025] What do we expect from LMMs as AIGI evaluators and how do they perform?
139Feb 3, 2025Updated last year

Alternatives and similar repositories for A-Bench

Users that are interested in A-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?