EdinburghNLP / MMLongBenchLinks
The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"
☆171Updated last month
Alternatives and similar repositories for MMLongBench
Users that are interested in MMLongBench are comparing it to the libraries listed below
Sorting:
- Jacobi Forcing: Fast and Accurate Diffusion-style Decoding☆147Updated this week
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆448Updated 3 weeks ago
- ☆111Updated 7 months ago
- Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)☆72Updated 6 months ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆159Updated last month
- ☆169Updated 5 months ago
- 🚀🚀 Efficient implementations of Native Sparse Attention☆1,044Updated 3 months ago
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆25Updated last month
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆333Updated last week
- The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs.☆101Updated 2 weeks ago
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆47Updated 9 months ago
- The code for the work "Adaptive Sample Scheduling for Direct Preference Optimization" submitted to the NeurIPS 2025 conference will be m…☆41Updated 3 months ago
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆161Updated 5 months ago
- A comprehensive collection of process reward models.☆130Updated 3 months ago
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆145Updated 8 months ago
- 🔥 JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization☆259Updated last week
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆86Updated 10 months ago
- [AAAI 26'] This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acc…☆46Updated last month
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆127Updated 3 weeks ago
- Multi-Reward as Condition for Instruction-Based Image Editing☆57Updated 9 months ago
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"☆124Updated 4 months ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆81Updated 3 weeks ago
- We are dedicated to building a set of open agent skills that deliver superior performance, higher determinism, and greater consistency on…☆68Updated last week
- Extrapolating RLVR to General Domains without Verifiers☆187Updated 4 months ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆68Updated 9 months ago
- TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based …☆837Updated last month
- [NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative …☆71Updated 2 months ago
- ☆153Updated 7 months ago
- ☆27Updated 3 months ago
- Description for MV-MATH☆15Updated 5 months ago