HumanEval-V / HumanEval-V-Benchmark

A Lightweight Visual Understanding and Reasoning Benchmark for Evaluating Large Multimodal Models through Coding Tasks
14Updated this week

Related projects

Alternatives and complementary repositories for HumanEval-V-Benchmark