Wang-ML-Lab / multimodal-needle-in-a-haystackLinks
[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models
☆45Updated last month
Alternatives and similar repositories for multimodal-needle-in-a-haystack
Users that are interested in multimodal-needle-in-a-haystack are comparing it to the libraries listed below
Sorting:
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆68Updated last year
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆69Updated 2 weeks ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆57Updated 8 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆38Updated 3 weeks ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆34Updated 2 months ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆39Updated 3 weeks ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆88Updated 8 months ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆60Updated 2 weeks ago
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆123Updated 2 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆41Updated 11 months ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆30Updated 3 weeks ago
- ☆78Updated 5 months ago
- A Sober Look at Language Model Reasoning☆74Updated last week
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆44Updated 3 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆44Updated 11 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆70Updated 4 months ago
- ☆50Updated last year
- ☆29Updated 2 months ago
- ☆44Updated 5 months ago
- ☆26Updated 11 months ago
- Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆44Updated 3 weeks ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 8 months ago
- Multimodal RewardBench☆41Updated 4 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated last month
- ☆14Updated last month
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆104Updated 3 weeks ago
- ☆42Updated 7 months ago
- ☆15Updated 2 months ago
- ☆100Updated last year
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated 3 weeks ago