Assessing Context-Aware Creative Intelligence in MLLMs
☆23Jul 22, 2025Updated 8 months ago
Alternatives and similar repositories for Creation-MMBench
Users that are interested in Creation-MMBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems☆120Jul 13, 2025Updated 8 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- LMM for VQA, tcsvt version☆10Jul 19, 2024Updated last year
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 10 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆56May 22, 2025Updated 10 months ago
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆119Feb 13, 2026Updated last month
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆112May 22, 2025Updated 10 months ago
- 2023 同济大学 操作系统 课程☆11Jun 28, 2023Updated 2 years ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆142Mar 6, 2026Updated 2 weeks ago
- This is the official repository for "Can GPTs Evaluate Graphic Design Based on Design Principles?".☆13Feb 10, 2025Updated last year
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆41Jun 9, 2025Updated 9 months ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆32Nov 30, 2025Updated 3 months ago
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated 11 months ago
- [NeurIPS'24 Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos☆39Apr 1, 2025Updated 11 months ago
- Benchmarking Multi-Step Spatial Reasoning in MLLMs with LEGO-based VQA & generation tasks.☆36Jun 20, 2025Updated 9 months ago
- [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling☆78Mar 2, 2026Updated 3 weeks ago
- ☆11Nov 5, 2024Updated last year
- ☆13Feb 7, 2023Updated 3 years ago
- [ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆26Oct 14, 2025Updated 5 months ago
- [CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆85Feb 13, 2026Updated last month
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆25Nov 28, 2025Updated 3 months ago
- The All-in-one Judge Models introduced by Opencompass☆116Jul 15, 2025Updated 8 months ago
- ☆15Mar 18, 2025Updated last year
- Backup repo for "MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos"☆14Feb 16, 2024Updated 2 years ago
- Teaching LMMs for Image Quality Scoring and Interpreting☆96Mar 25, 2025Updated 11 months ago
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- [ICLR 2026] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆80Mar 13, 2026Updated last week
- ☆44Aug 31, 2025Updated 6 months ago
- [TIP 2025] Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation☆10Jul 8, 2023Updated 2 years ago
- ☆17Apr 23, 2025Updated 11 months ago
- This is the official PyTorch implementation of ASAG (ICCV 2023).☆18Sep 9, 2023Updated 2 years ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆22Feb 16, 2025Updated last year
- (ECCV 2024) Official repository of paper "Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection"☆21Mar 26, 2025Updated 11 months ago
- [TCSVT'24] Offical Implementation of 2AFC-LMMs☆12Aug 17, 2024Updated last year
- Code for paper "No-reference Point Cloud Quality Assessment via Domain Adaptation", CVPR2022☆11May 11, 2022Updated 3 years ago
- Official repo for "GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment"☆14Mar 10, 2024Updated 2 years ago
- A TensorFlow Implementation of GraLSP: Graph Neural Networks with Local Structural Patterns, In AAAI, 2020.☆12Jun 25, 2020Updated 5 years ago
- ☆16Oct 21, 2024Updated last year
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago