Aurora-slz/MM-Verify

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Aurora-slz/MM-Verify)

Aurora-slz / MM-Verify

☆18

Alternatives and similar repositories for MM-Verify

Users that are interested in MM-Verify are comparing it to the libraries listed below

Sorting:

Vision-CAIR / Infinibench
View on GitHub
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
☆19Nov 4, 2025Updated 4 months ago
THU-KEG / ReaRAG
View on GitHub
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation
☆25Aug 24, 2025Updated 6 months ago
ArminAzizi98 / LaMDA
View on GitHub
☆15Nov 7, 2024Updated last year
InternScience / TrustGeoGen
View on GitHub
Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"
☆23Sep 1, 2025Updated 6 months ago
wenhuang2000 / VHTest
View on GitHub
VHTest
☆15Oct 31, 2024Updated last year
MME-Benchmarks / MME-Unify
View on GitHub
✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models
☆43Apr 10, 2025Updated 10 months ago
qunzhongwang / vr-thinker
View on GitHub
☆42Oct 20, 2025Updated 4 months ago
Rh-Dang / ECBench
View on GitHub
A Holistic Embodied Cognition Benchmark
☆18Apr 3, 2025Updated 11 months ago
Aurora-slz / Synth-Empathy
View on GitHub
Synth-Empathy: Towards High-Quality Synthetic Empathy Data
☆18Feb 28, 2025Updated last year
Mind4Compiler / Compiler-R1
View on GitHub
Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning
☆28Jul 14, 2025Updated 7 months ago
archiki / RepARe
View on GitHub
☆21Oct 10, 2023Updated 2 years ago
daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"
☆52Dec 5, 2024Updated last year
DualityRL / multi-attempt
View on GitHub
☆19Mar 10, 2025Updated 11 months ago
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 7 months ago
facebookresearch / multimodal_rewardbench
View on GitHub
Multimodal RewardBench
☆62Feb 21, 2025Updated last year
CYWang735 / AdaTooler-V
View on GitHub
☆58Feb 27, 2026Updated last week
dongxiangjue / Awesome-LLM-Self-Improvement
View on GitHub
A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …
☆102Dec 24, 2024Updated last year
naver-ai / clip4dm
View on GitHub
Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)
☆25Apr 20, 2025Updated 10 months ago
inclusionAI / M2-Reasoning
View on GitHub
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
☆46Jul 17, 2025Updated 7 months ago
aeroplanepaper / GRPO-LEAD
View on GitHub
☆34Nov 18, 2025Updated 3 months ago
qirui-chen / MultiHop-EgoQA
View on GitHub
[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
☆33May 27, 2025Updated 9 months ago
LLMSQL / llmsql-benchmark
View on GitHub
A Text2SQL benchmark for evaluation of Large Language Models
☆41Updated this week
haoningwu3639 / MRGen
View on GitHub
[ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities
☆38Sep 26, 2025Updated 5 months ago
Yarayx / livelongbench
View on GitHub
The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…
☆12Jun 28, 2025Updated 8 months ago
V-STaR-Bench / V-STaR
View on GitHub
Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
☆41Aug 4, 2025Updated 7 months ago
Lillianwei-h / MMIE
View on GitHub
[ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
☆35Nov 3, 2024Updated last year
Hon-Wong / ByteVideoLLM
View on GitHub
[ICCV 2025] Dynamic-VLM
☆28Dec 16, 2024Updated last year
Saehyung-Lee / PlugIR
View on GitHub
Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)
☆34Mar 24, 2025Updated 11 months ago
qiujihao19 / Artemis
View on GitHub
[NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos
☆27Apr 8, 2025Updated 10 months ago
OpenMOSS / LongLLaDA
View on GitHub
[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
☆52Dec 7, 2025Updated 3 months ago
wln20 / CSKV
View on GitHub
[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
☆16Oct 18, 2024Updated last year
LHRLAB / KBQA-o1
View on GitHub
[ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".
☆35Dec 6, 2025Updated 3 months ago
IVUL-KAUST / VideoAuto-R1
View on GitHub
[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
☆65Feb 27, 2026Updated last week
marinero4972 / CyberV
View on GitHub
☆18Jun 10, 2025Updated 8 months ago
techmonsterwang / iLLaMA
View on GitHub
Adapting LLaMA Decoder to Vision Transformer
☆30May 20, 2024Updated last year
yuhui-zh15 / C3
View on GitHub
Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
☆34Oct 16, 2024Updated last year
deepglint / RealSyn
View on GitHub
[ACM MM2025] The official repository for the RealSyn dataset
☆40Dec 14, 2025Updated 2 months ago
uw-nsl / TinyV
View on GitHub
Your efficient and accurate answer verification system for RL training.
☆41Jun 23, 2025Updated 8 months ago
AlignGPT-VL / AlignGPT
View on GitHub
Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"
☆34Jul 12, 2024Updated last year