EdinburghNLP / MMLongBenchLinks
The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"
☆176Updated 2 months ago
Alternatives and similar repositories for MMLongBench
Users that are interested in MMLongBench are comparing it to the libraries listed below
Sorting:
- [ACL 2025 Main] MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration☆22Updated 8 months ago
- Jacobi Forcing: Fast and Accurate Diffusion-style Decoding☆154Updated last month
- ☆112Updated 8 months ago
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆454Updated last month
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆349Updated last month
- [AAAI 26'] This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acc…☆61Updated 2 months ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆159Updated 2 months ago
- Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)☆72Updated 7 months ago
- The code for the work "Adaptive Sample Scheduling for Direct Preference Optimization" submitted to the NeurIPS 2025 conference will be m…☆41Updated 4 months ago
- ☆51Updated last week
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆26Updated 2 months ago
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆48Updated 10 months ago
- ☆169Updated 6 months ago
- 🚀🚀 Efficient implementations of Native Sparse Attention☆1,044Updated 4 months ago
- ☆169Updated 3 weeks ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆90Updated 10 months ago
- A comprehensive collection of process reward models.☆136Updated 4 months ago
- DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizin…☆116Updated this week
- The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs.☆104Updated 3 weeks ago
- [ACL 2025] A Neural-Symbolic Self-Training Framework☆117Updated 8 months ago
- We are dedicated to building a set of open agent skills that deliver superior performance, higher determinism, and greater consistency on…☆144Updated last month
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆62Updated 4 months ago
- Extrapolating RLVR to General Domains without Verifiers☆196Updated 5 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆51Updated last year
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆161Updated 6 months ago
- [NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative …☆71Updated 3 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆160Updated 3 months ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆84Updated last month
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆56Updated 8 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆220Updated 9 months ago