EdinburghNLP / MMLongBenchLinks
The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"
☆175Updated 2 months ago
Alternatives and similar repositories for MMLongBench
Users that are interested in MMLongBench are comparing it to the libraries listed below
Sorting:
- ☆112Updated 8 months ago
- Jacobi Forcing: Fast and Accurate Diffusion-style Decoding☆154Updated 3 weeks ago
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆452Updated last month
- Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)☆72Updated 7 months ago
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆342Updated last month
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆158Updated 2 months ago
- The code for the work "Adaptive Sample Scheduling for Direct Preference Optimization" submitted to the NeurIPS 2025 conference will be m…☆41Updated 4 months ago
- ☆160Updated 2 weeks ago
- 🚀🚀 Efficient implementations of Native Sparse Attention☆1,044Updated 4 months ago
- ☆169Updated 5 months ago
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆46Updated 9 months ago
- [NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative …☆71Updated 3 months ago
- We are dedicated to building a set of open agent skills that deliver superior performance, higher determinism, and greater consistency on…☆124Updated last month
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆161Updated 6 months ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆82Updated 9 months ago
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆26Updated 2 months ago
- [AAAI 26'] This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acc…☆49Updated 2 months ago
- The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs.☆102Updated 3 weeks ago
- Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows"☆37Updated 2 months ago
- Extrapolating RLVR to General Domains without Verifiers☆191Updated 5 months ago
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"☆126Updated 5 months ago
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆145Updated 9 months ago
- ☆27Updated 3 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆87Updated 11 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆62Updated 8 months ago
- ☆23Updated 3 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆155Updated 3 months ago
- [NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory☆125Updated 2 months ago
- Papers list of empathy in LMs: theory, modeling, systems, emotion, evaluation.☆85Updated 2 weeks ago
- 🔥 JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization☆272Updated last week