open-compass / Creation-MMBenchLinks
Assessing Context-Aware Creative Intelligence in MLLMs
☆21Updated 3 weeks ago
Alternatives and similar repositories for Creation-MMBench
Users that are interested in Creation-MMBench are comparing it to the libraries listed below
Sorting:
- Official implement of MIA-DPO☆59Updated 5 months ago
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆81Updated 10 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated last month
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆44Updated last year
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 5 months ago
- Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"☆49Updated 4 months ago
- LMM solved catastrophic forgetting, AAAI2025☆44Updated 3 months ago
- Official repository of MMDU dataset☆92Updated 9 months ago
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆79Updated 5 months ago
- [NeurIPS 2024] Efficient Large Multi-modal Models via Visual Context Compression☆60Updated 4 months ago
- ☆90Updated 3 weeks ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆57Updated 3 weeks ago
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆69Updated last week
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆24Updated last month
- (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆35Updated 2 weeks ago
- SFT+RL boosts multimodal reasoning☆19Updated 2 weeks ago
- ☆45Updated 6 months ago
- ☆64Updated 3 weeks ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆60Updated this week
- ☆18Updated 8 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆55Updated 10 months ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆45Updated 2 weeks ago
- ☆73Updated last year
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆58Updated 9 months ago
- ☆87Updated 3 weeks ago
- A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation☆71Updated 4 months ago
- MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆41Updated 3 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆124Updated last month
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆45Updated 6 months ago