EdinburghNLP / MMLongBenchLinks
The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"
☆155Updated 3 months ago
Alternatives and similar repositories for MMLongBench
Users that are interested in MMLongBench are comparing it to the libraries listed below
Sorting:
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆420Updated 2 months ago
- Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)☆67Updated 2 months ago
- ☆160Updated 3 weeks ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆148Updated last week
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆270Updated 3 months ago
- Easy Data Preparation with latest LLMs-based Operators and Pipelines.☆1,189Updated this week
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆153Updated last month
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆824Updated this week
- 🚀🚀 Efficient implementations of Native Sparse Attention☆900Updated this week
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆137Updated 4 months ago
- Extrapolating RLVR to General Domains without Verifiers☆151Updated 3 weeks ago
- 【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…☆45Updated 5 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆81Updated 6 months ago
- ☆49Updated 3 months ago
- my commonly-used tools☆61Updated 7 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆40Updated 6 months ago
- ☆262Updated 2 months ago
- TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based …☆335Updated 2 weeks ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆172Updated 4 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆50Updated 10 months ago
- A version of verl to support tool use☆352Updated this week
- ☆109Updated 3 months ago
- Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆100Updated last week
- ☆328Updated last month
- Chiron-o1: Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs☆61Updated last month
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆57Updated 2 months ago
- Native-resolution diffusion Transformer☆281Updated 3 months ago
- Official Repository of "Learning what reinforcement learning can't"☆64Updated last week
- A comprehensive collection of process reward models.☆107Updated last month
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆224Updated 2 weeks ago