HumanEval-V / HumanEval-V-BenchmarkView on GitHub
A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks
14Feb 25, 2025Updated last year

Alternatives and similar repositories for HumanEval-V-Benchmark

Users that are interested in HumanEval-V-Benchmark are comparing it to the libraries listed below

Sorting:

Are these results useful?