Aurora-slz / MM-VerifyView external linksLinks
☆18Oct 28, 2025Updated 3 months ago
Alternatives and similar repositories for MM-Verify
Users that are interested in MM-Verify are comparing it to the libraries listed below
Sorting:
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆25Aug 24, 2025Updated 5 months ago
- ☆15Nov 7, 2024Updated last year
- VHTest☆15Oct 31, 2024Updated last year
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 5 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- ☆42Oct 20, 2025Updated 3 months ago
- A Holistic Embodied Cognition Benchmark☆18Apr 3, 2025Updated 10 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 7 months ago
- ☆21Oct 10, 2023Updated 2 years ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆52Dec 5, 2024Updated last year
- ☆19Mar 10, 2025Updated 11 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- Multimodal RewardBench☆61Feb 21, 2025Updated 11 months ago
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)☆25Apr 20, 2025Updated 9 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆102Dec 24, 2024Updated last year
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 6 months ago
- ☆33Nov 18, 2025Updated 2 months ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆32May 27, 2025Updated 8 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆38Sep 26, 2025Updated 4 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆41Aug 4, 2025Updated 6 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆35Nov 3, 2024Updated last year
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Mar 24, 2025Updated 10 months ago
- ☆27Apr 8, 2025Updated 10 months ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆52Dec 7, 2025Updated 2 months ago
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆34Dec 6, 2025Updated 2 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- Adapting LLaMA Decoder to Vision Transformer☆30May 20, 2024Updated last year
- [ACM MM2025] The official repository for the RealSyn dataset☆40Dec 14, 2025Updated 2 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆34Jul 12, 2024Updated last year
- A Survey on Benchmarks of Multimodal Large Language Models☆148Jul 1, 2025Updated 7 months ago
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 7 months ago
- ICCV 2025: Official Implematation of "Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced L…☆59Oct 25, 2025Updated 3 months ago
- Official repository for K-EXAONE built by LG AI Research☆66Feb 6, 2026Updated last week
- [COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs☆145Aug 23, 2024Updated last year