HumanEval-V / HumanEval-V-Benchmark

A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks
6Updated 3 weeks ago

Alternatives and similar repositories for HumanEval-V-Benchmark:

Users that are interested in HumanEval-V-Benchmark are comparing it to the libraries listed below