GaryStack/MMR-V

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GaryStack/MMR-V)

GaryStack / MMR-V

Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"? [ICLR26]

☆40

Alternatives and similar repositories for MMR-V

Users that are interested in MMR-V are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GaryStack / Trustworthy-Evaluation
View on GitHub
Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)
☆19Jul 19, 2025Updated last year
HongbangYuan / OmniReward
View on GitHub
☆47Dec 16, 2025Updated 7 months ago
jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
THU-KEG / DICE
View on GitHub
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
☆12Sep 21, 2024Updated last year
THU-KEG / DeepPrune
View on GitHub
🌿 DeepPrune: Parallel Scaling without Inter-trace Redundancy
☆21Apr 20, 2026Updated 3 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
jinzhuoran / MiNer
View on GitHub
A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing. EMNLP 2022
☆11Feb 1, 2023Updated 3 years ago
jinzhuoran / CogKGE
View on GitHub
CogKGE: A Knowledge Graph Embedding Toolkit and Benchmark for Representing Multi-source and Heterogeneous Knowledge. ACL 2022
☆59Jun 5, 2022Updated 4 years ago
CogNLP / CogKTR
View on GitHub
CogKTR: A Knowledge-Enhanced Text Representation Toolkit for Natural Language Understanding. EMNLP 2022
☆32Oct 14, 2022Updated 3 years ago
llyx97 / video_reason_bench
View on GitHub
[ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…
☆41Jan 30, 2026Updated 5 months ago
DA-Open / DV-World
View on GitHub
[ICML 2026] DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios
☆69Apr 29, 2026Updated 2 months ago
CSHaitao / LegalAgentBench
View on GitHub
The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl
☆49Apr 10, 2026Updated 3 months ago
Xnhyacinth / NesyCD
View on GitHub
[AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks
☆12Jun 19, 2025Updated last year
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
zhaoxlpku / PromptCoT
View on GitHub
☆17Apr 10, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ModalMinds / gym-v
View on GitHub
A unified framework for vision-language environments with Gymnasium-compatible interface
☆35Mar 17, 2026Updated 4 months ago
chuzhumin98 / PRE
View on GitHub
A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs
☆19Aug 3, 2024Updated last year
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 10 months ago
refkxh / DUSA
View on GitHub
[ACM MM 2023] Official implementation of DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Percept…
☆12Nov 17, 2023Updated 2 years ago
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆18Jun 30, 2026Updated 3 weeks ago
Fu-Fu-Fu-Fu / VideoKR
View on GitHub
[ICML 26 Spotlight] Code for paper "VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding"
☆19Jun 5, 2026Updated last month
jinzhuoran / CogIE
View on GitHub
CogIE: An Information Extraction Toolkit for Bridging Text and CogNet. ACL 2021
☆71Aug 27, 2022Updated 3 years ago
holarissun / RewardModelingBeyondBradleyTerry
View on GitHub
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…
☆73Apr 2, 2025Updated last year
hzy312 / knowledge-r1
View on GitHub
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆70May 13, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
codephage2020 / slock-desktop
View on GitHub
Slock workspace client for macOS.
☆27May 11, 2026Updated 2 months ago
JianyuanZhong / StableDRL
View on GitHub
☆15Updated this week
zeyuanyin / LTH-Backdoor
View on GitHub
[Preprint] Backdoor Attacks on Federated Learning with Lottery Ticket Hypothesis
☆10Sep 23, 2021Updated 4 years ago
chenlong-clock / RULE-Unlearn
View on GitHub
[NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality
☆20Oct 22, 2025Updated 8 months ago
Timothyxxx / TestTimeTrainingPapers
View on GitHub
☆59Apr 13, 2026Updated 3 months ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated 11 months ago
Starsshine21 / RL100
View on GitHub
☆21Jun 22, 2026Updated 3 weeks ago
longvideobench / LongVideoBench
View on GitHub
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
☆133Jul 27, 2024Updated last year
MoonshotAI / Kimi-Researcher
View on GitHub
☆80Jun 20, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
devopsdymyr / Evo-Memory
View on GitHub
Implementation of Evo-Memory style learning for LLM agents. Agents learn from outcomes, refine strategies, and get smarter with every tas…
☆48Dec 3, 2025Updated 7 months ago
yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
MisterBrookT / IGenBench
View on GitHub
[ACL 2026] A benchmark for evaluating the reliability of text-to-infographic generation with curated test cases and automated question-ba…
☆15Jun 8, 2026Updated last month
hshjerry / VideoEspresso
View on GitHub
[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
☆140Jul 28, 2025Updated 11 months ago
interactivebench / InteractiveBench
View on GitHub
Official Project Page for Interactive Benchmarks
☆31May 12, 2026Updated 2 months ago
refkxh / BiCo
View on GitHub
[CVPR 2026 Highlight] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding
☆85May 31, 2026Updated last month
CLR-Lab / SimKO
View on GitHub
SimKO: Simple Pass@K Policy Optimization
☆31Oct 24, 2025Updated 8 months ago