Q-Future / A-BenchView on GitHub
[ICLR 2025] What do we expect from LMMs as AIGI evaluators and how do they perform?
138Feb 3, 2025Updated last year

Alternatives and similar repositories for A-Bench

Users that are interested in A-Bench are comparing it to the libraries listed below

Sorting:

Are these results useful?