HumanEval-V / HumanEval-V-Benchmark

A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks
12Updated 2 months ago

Alternatives and similar repositories for HumanEval-V-Benchmark

Users that are interested in HumanEval-V-Benchmark are comparing it to the libraries listed below

Sorting: