EdinburghNLP / MMLongBenchLinks
The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"
☆157Updated 3 weeks ago
Alternatives and similar repositories for MMLongBench
Users that are interested in MMLongBench are comparing it to the libraries listed below
Sorting:
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆432Updated last week
- Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)☆70Updated 4 months ago
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆309Updated 4 months ago
- ☆165Updated 2 months ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆152Updated 2 months ago
- The code for the work "Adaptive Sample Scheduling for Direct Preference Optimization" submitted to the NeurIPS 2025 conference will be m…☆42Updated last month
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆158Updated 3 months ago
- ☆67Updated 2 weeks ago
- 🚀🚀 Efficient implementations of Native Sparse Attention☆980Updated last month
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆139Updated 6 months ago
- Multi-Reward as Condition for Instruction-Based Image Editing☆56Updated 7 months ago
- [NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative …☆66Updated last week
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆45Updated 6 months ago
- TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based …☆799Updated last week
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"☆68Updated 2 months ago
- [NeurIPS 2025] Native-resolution diffusion Transformer☆286Updated 2 weeks ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆67Updated 4 months ago
- CausalVLR: A Toolbox and Benchmark for Vision-Language Causal Reasoning (多模态因果推理开源框架)☆1,142Updated 3 weeks ago
- Easy Data Preparation with latest LLMs-based Operators and Pipelines.☆1,426Updated this week
- ☆27Updated 3 weeks ago
- Personalized Fragrance Recommendation for Aromatherapy: A Machine Learning Approach Based on Personality Traits and Electrodermal Activit…☆14Updated 6 months ago
- Extrapolating RLVR to General Domains without Verifiers☆176Updated 2 months ago
- ☆278Updated 3 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆55Updated 5 months ago
- A Business-Driven Real-World Financial Benchmark for Evaluating LLMs☆207Updated last month
- [ACL 2025] A Neural-Symbolic Self-Training Framework☆115Updated 5 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆195Updated 5 months ago
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆23Updated 2 weeks ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆50Updated last year
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆124Updated 4 months ago