MING-ZCH / CII-BenchLinks
[ACL 2025] Can MLLMs Understand the Deep Implication Behind Chinese Images?
☆17Updated last month
Alternatives and similar repositories for CII-Bench
Users that are interested in CII-Bench are comparing it to the libraries listed below
Sorting:
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆33Updated last year
- ☆55Updated last year
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆31Updated 8 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆36Updated last month
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆21Updated 9 months ago
- ☆14Updated 9 months ago
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆83Updated 8 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆26Updated 4 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆44Updated this week
- ☆82Updated last year
- ☆19Updated 4 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆16Updated 7 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆18Updated 5 months ago
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆56Updated 3 months ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆113Updated 3 months ago
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆25Updated 4 months ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆55Updated 3 months ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoning☆84Updated 8 months ago
- Recent Advances on MLLM's Reasoning Ability☆26Updated 5 months ago
- ☆21Updated 4 months ago
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆123Updated 3 months ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆48Updated 2 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆21Updated this week
- Official implement of MIA-DPO☆66Updated 8 months ago
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆46Updated last year
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆51Updated 6 months ago
- A benchmark for evaluating vision-centric, complex video reasoning.☆33Updated last month
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆29Updated 2 months ago