MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria
☆74Oct 16, 2024Updated last year
Alternatives and similar repositories for MLLM-Bench
Users that are interested in MLLM-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆36Jul 11, 2024Updated last year
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆199Oct 15, 2024Updated last year
- [ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'