☆16Oct 21, 2024Updated last year
Alternatives and similar repositories for WildVision-Bench
Users that are interested in WildVision-Bench are comparing it to the libraries listed below
Sorting:
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Mar 26, 2025Updated 11 months ago
- ☆11Nov 5, 2024Updated last year
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- Official repo for "GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment"☆14Mar 10, 2024Updated last year
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"☆287May 22, 2025Updated 9 months ago
- A Comprehensive Benchmark for Robust Multi-image Understanding☆19Sep 4, 2024Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- Official implement of MIA-DPO☆70Jan 23, 2025Updated last year
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆80Sep 13, 2024Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- ☆50Oct 29, 2023Updated 2 years ago
- PyTorch code for our paper "Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grain Image Quality Assessment"☆28Oct 7, 2024Updated last year
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- [CVPRW 2023] Zoom-VQA: Patches, Frames and Clips Integration for Video Quality Assessment☆32Apr 17, 2023Updated 2 years ago
- Official repo for 'MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality Assessment' IJCAI2023☆32Nov 30, 2023Updated 2 years ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆36Jul 11, 2024Updated last year
- Fast Blind Natural Video Quality (V-BLIINDS)☆27Dec 8, 2024Updated last year
- [IEEE TCSVT2023] A Fine-grained Subjective Perception & Alignment Database for AI Generated Image Quality Assessment☆69Oct 24, 2023Updated 2 years ago
- [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆203Sep 26, 2024Updated last year
- [ACMMM 2025] Benchmarking MLLM Codec Ability☆33Jun 14, 2024Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆34Aug 15, 2024Updated last year
- ☆31Dec 19, 2023Updated 2 years ago
- ☆11Mar 11, 2024Updated last year
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Dec 19, 2024Updated last year
- [IEEE TCSVT'24] Study of Subjective and Objective Naturalness Assessment of AI-Generated Images☆37Feb 9, 2026Updated 3 weeks ago
- Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]☆239Jan 3, 2026Updated last month
- [SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆44Nov 19, 2025Updated 3 months ago
- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)☆35Nov 12, 2024Updated last year
- Holistic evaluation of multimodal foundation models☆49Aug 11, 2024Updated last year
- [CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…☆47May 26, 2025Updated 9 months ago
- ☆12Jan 11, 2026Updated last month
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- [Advanced Photonics Research, 2021] Control tightly focused fields via manipulating pupil functions☆10Dec 25, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year