HumanEval-V / HumanEval-V-Benchmark

A Lightweight Visual Understanding and Reasoning Benchmark for Evaluating Large Multimodal Models through Coding Tasks
13Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for HumanEval-V-Benchmark