Lillianwei-h/MMIE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Lillianwei-h/MMIE)

Lillianwei-h / MMIE

[ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

☆35

Alternatives and similar repositories for MMIE

Users that are interested in MMIE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aiming-lab / EduVisAgent
View on GitHub
[ICLR'26] EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization
☆30Aug 5, 2025Updated 11 months ago
richard-peng-xia / RULE
View on GitHub
[EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
☆98Dec 13, 2024Updated last year
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
Ravindu-Yasas-Nagasinghe / KEPP
View on GitHub
[CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos
☆12Sep 24, 2024Updated last year
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
huaxiuyao / KGML
View on GitHub
KGML for EMNLP 2021
☆10Feb 2, 2022Updated 4 years ago
agneet42 / revision
View on GitHub
[ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"
☆13Aug 6, 2024Updated last year
huaxiuyao / HSML_Dynamic
View on GitHub
HSML Dynamic version for ICML 2019
☆12Jul 11, 2019Updated 7 years ago
Aurora-slz / MM-Verify
View on GitHub
☆19Oct 28, 2025Updated 8 months ago
richard-peng-xia / MMed-RAG
View on GitHub
[ICLR'25] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
☆335Jan 22, 2025Updated last year
LesterGong / MMRB
View on GitHub
The official repository of paper "Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark"
☆19Jun 20, 2025Updated last year
SparksJoe / Prism
View on GitHub
A Framework for Decoupling and Assessing the Capabilities of VLMs
☆44Jun 28, 2024Updated 2 years ago
YiyangZhou / CSR
View on GitHub
[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models
☆87Oct 26, 2025Updated 8 months ago
mahtabbigverdi / Aurora
View on GitHub
☆12Dec 4, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mhw32 / prototransformer-public
View on GitHub
PyTorch implementation for "ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback" (https://arxiv.org/abs/2107.14035).
☆16Sep 9, 2022Updated 3 years ago
apple / ml-mia-bench
View on GitHub
This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
☆38Mar 9, 2025Updated last year
hewei2001 / ReachQA
View on GitHub
[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs
☆61Aug 25, 2025Updated 10 months ago
uqzhichen / Awesome-compositional-zero-shot-learning
View on GitHub
Paper list of compositional zero-shot learning
☆11Jul 5, 2022Updated 4 years ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
aiming-lab / MMedPO
View on GitHub
[ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization
☆74Jun 5, 2025Updated last year
nelson-liu / website
View on GitHub
☆13Feb 5, 2022Updated 4 years ago
mbzuai-oryx / TimeTravel
View on GitHub
[ACL 2025 🔥] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts
☆20May 22, 2025Updated last year
real-absolute-AI / SynthRL
View on GitHub
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
☆70Jul 24, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Sueqk / LMM-VQA
View on GitHub
LMM for VQA, tcsvt version
☆10Jul 19, 2024Updated 2 years ago
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆20Mar 4, 2025Updated last year
RAIVNLab / neural-priming
View on GitHub
Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"
☆14Nov 13, 2023Updated 2 years ago
UCSC-VLAA / vllm-safety-benchmark
View on GitHub
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
☆89Nov 28, 2023Updated 2 years ago
Sreyan88 / VDGD
View on GitHub
Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
☆25May 7, 2025Updated last year
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 9 months ago
Bollegala / DARep
View on GitHub
Cross-domain word representation learning
☆10May 23, 2015Updated 11 years ago
yxin98 / EMNLP_2022
View on GitHub
☆13Jun 7, 2022Updated 4 years ago
sail-sg / Video-Next-Event-Prediction
View on GitHub
☆28Aug 9, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hsajjad / ConceptX
View on GitHub
Analyzing Latent Concept in Pre-trained Transformer Models
☆12Jul 18, 2022Updated 4 years ago
mathllm / MathCoder2
View on GitHub
☆71Oct 16, 2024Updated last year
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
ahmdtaha / distributed_sigmoid_loss
View on GitHub
Unofficial implementation for Sigmoid Loss for Language Image Pre-Training
☆11Sep 26, 2023Updated 2 years ago
mansheej / icl-task-diversity
View on GitHub
Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"
☆27Jun 28, 2023Updated 3 years ago
corca-ai / evaluating-gpt-4o-on-CLIcK
View on GitHub
Evaluate gpt-4o on CLIcK (Korean NLP Dataset)
☆20May 18, 2024Updated 2 years ago
aiming-lab / AutoHarness
View on GitHub
AutoHarness: Automated Harness Engineering for AI Agents
☆351Apr 2, 2026Updated 3 months ago