InternScience/MME-Reasoning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InternScience/MME-Reasoning)

InternScience / MME-Reasoning

Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs

☆45

Alternatives and similar repositories for MME-Reasoning

Users that are interested in MME-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InternScience / Dolphin
View on GitHub
(ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback
☆44Jun 24, 2025Updated last year
InternScience / Chimera
View on GitHub
(ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts
☆87Oct 29, 2025Updated 8 months ago
ch3cook-fdu / Vote2Cap-DETR
View on GitHub
[T-PAMI 2024] & [CVPR 2023] Vote2Cap-DETR; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning met…
☆104Aug 17, 2024Updated last year
HankYe / Once-for-Both
View on GitHub
[CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
☆15Jul 1, 2024Updated 2 years ago
InternScience / OmniCaptioner
View on GitHub
Official Repository of OmniCaptioner
☆168Apr 23, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
real-absolute-AI / NoisyRollout
View on GitHub
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆112Sep 18, 2025Updated 10 months ago
Hui-design / R1-Video-fixbug
View on GitHub
[Blog 1] Recording a bug of grpo_trainer in some R1 projects
☆23Feb 23, 2025Updated last year
InternScience / AdaptiveDiffusion
View on GitHub
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
☆73Jan 22, 2025Updated last year
Sunshine-Ye / Beta-DARTS
View on GitHub
official implementation of β-DARTS: Beta-Decay Regularization for Differentiable Architecture Search (CVPR22 oral).
☆86Mar 29, 2022Updated 4 years ago
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
kxfan2002 / Reagent
View on GitHub
Agent-RRM: Exploring Reasoning Reward Model for Agents
☆70Mar 17, 2026Updated 4 months ago
InternScience / TrustGeoGen
View on GitHub
Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"
☆23Sep 1, 2025Updated 10 months ago
inFaaa / Evolver
View on GitHub
[COLING 2025🔥] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
☆17Jan 21, 2025Updated last year
kxfan2002 / SophiaVL-R1
View on GitHub
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
☆94Aug 8, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Zhengsh123 / FREE-Merging
View on GitHub
The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)
☆16Jun 26, 2025Updated last year
FAVOR-Bench / FAVOR-Bench
View on GitHub
Accepted By The 39th Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track
☆25Nov 17, 2025Updated 8 months ago
LeiBAI / Paper-Writing-Rebuttal
View on GitHub
Some thoughts about writing scientific papers
☆23Nov 8, 2024Updated last year
InternScience / Agents-A1
View on GitHub
Scaling the Horizon, Not the Parameters
☆501Updated this week
AIoT-MLSys-Lab / MEDA
View on GitHub
[NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
☆22Jun 19, 2025Updated last year
Peyton-Chen / RegionE
View on GitHub
[ICLR 2026] The official implementation of "RegionE: Adaptive Region-Aware Generation for Efficient Image Editing"
☆109Feb 3, 2026Updated 5 months ago
SUSTechBruce / LOOK-M
View on GitHub
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆103Nov 9, 2024Updated last year
pengts / VW-LMM
View on GitHub
☆25May 13, 2024Updated 2 years ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mm-vl / ULM-R1
View on GitHub
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
☆48Jul 22, 2025Updated 11 months ago
luka-group / mDPO
View on GitHub
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆88Nov 10, 2024Updated last year
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
TIGER-AI-Lab / VL-Rethinker
View on GitHub
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆189Jun 5, 2025Updated last year
InternScience / SimChart9K
View on GitHub
The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.
☆26Feb 22, 2024Updated 2 years ago
franciszzj / VLPrompt
View on GitHub
[IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation
☆28Sep 24, 2024Updated last year
HankYe / KVCOMM
View on GitHub
[NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems
☆17Nov 1, 2025Updated 8 months ago
Peyton-Chen / Sparse-vDiT
View on GitHub
The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …
☆52Jun 6, 2025Updated last year
Line-Kite / GraphLayoutLM
View on GitHub
☆14Sep 6, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
MBZUAI / AI4Bio-Reading-List
View on GitHub
Must-read papers on AI for Biology
☆26Oct 4, 2023Updated 2 years ago
jylins / hourllava
View on GitHub
[NeurIPS 2025 Spotlight] Unleashing Hour-Scale Video Training for Long Video-Language Understanding
☆19Jun 24, 2025Updated last year
JierunChen / SFT-RL-SynergyDilemma
View on GitHub
☆15Jan 14, 2026Updated 6 months ago
Frostlinx / SearchEyes
View on GitHub
SearchEyes: Towards Frontier Multimodal Deep Search Intelligence via Search World Simulation. A typed knowledge graph unifies data synthe…
☆20Jul 8, 2026Updated last week
InternScience / SciEvalKit
View on GitHub
A unified evaluation toolkit and leaderboard for rigorously assessing the scientific intelligence of large language and vision–language m…
☆85Jun 17, 2026Updated last month
TencentARC / Video-Holmes
View on GitHub
[ECCV 2026] Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
☆95Jul 13, 2025Updated last year
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆39Feb 25, 2026Updated 4 months ago