EdinburghNLP / MMLongBenchLinks
The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"
☆168Updated 3 weeks ago
Alternatives and similar repositories for MMLongBench
Users that are interested in MMLongBench are comparing it to the libraries listed below
Sorting:
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆439Updated last month
- Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)☆72Updated 5 months ago
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆320Updated 6 months ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆157Updated last month
- ☆169Updated 4 months ago
- 🚀🚀 Efficient implementations of Native Sparse Attention☆1,037Updated 2 months ago
- [AAAI 26'] This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acc…☆46Updated last month
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆46Updated 8 months ago
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆161Updated 5 months ago
- The code for the work "Adaptive Sample Scheduling for Direct Preference Optimization" submitted to the NeurIPS 2025 conference will be m…☆41Updated 2 months ago
- Multi-Reward as Condition for Instruction-Based Image Editing☆57Updated 8 months ago
- Personalized Fragrance Recommendation for Aromatherapy: A Machine Learning Approach Based on Personality Traits and Electrodermal Activit…☆14Updated 7 months ago
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆25Updated 2 weeks ago
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"☆123Updated 4 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆85Updated 10 months ago
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆140Updated 7 months ago
- CausalVLR: A Toolbox and Benchmark for Vision-Language Causal Reasoning (多模态因果推理开源框架)☆1,148Updated 2 months ago
- A lightweight and extensible toolkit for visualizing attention flow in Large Vision-Language Models (LVLMs). It renders token-to-token at…☆107Updated this week
- ☆27Updated 2 months ago
- TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based …☆830Updated 3 weeks ago
- Papers list of empathy in LMs: theory, modeling, systems, emotion, evaluation.☆79Updated last week
- Extrapolating RLVR to General Domains without Verifiers☆181Updated 4 months ago
- ☆98Updated 3 weeks ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆77Updated 5 months ago
- [NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative …☆71Updated last month
- A comprehensive collection of process reward models.☆127Updated 2 months ago
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆117Updated 3 weeks ago
- A Business-Driven Real-World Financial Benchmark for Evaluating LLMs☆212Updated 2 weeks ago
- Official Repository of "Learning what reinforcement learning can't"☆70Updated 3 weeks ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆47Updated 3 months ago