mayubo2333/MMLongBench-Doc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mayubo2333/MMLongBench-Doc)

mayubo2333 / MMLongBench-Doc

Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations

☆149

Alternatives and similar repositories for MMLongBench-Doc

Users that are interested in MMLongBench-Doc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WxxShirley / MoLoRAG
View on GitHub
[EMNLP 2025] Official implementation for paper "MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval"
☆27Mar 17, 2026Updated 4 months ago
dengc2023 / LongDocURL
View on GitHub
☆42Apr 6, 2026Updated 3 months ago
Anni-Zou / DocBench
View on GitHub
DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
☆79Sep 29, 2024Updated last year
nttmdlab-nlp / SlideVQA
View on GitHub
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
☆106Mar 31, 2025Updated last year
bloomberg / m3docrag
View on GitHub
☆71May 19, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SalesforceAIResearch / UniDoc-Bench
View on GitHub
☆38Jun 2, 2026Updated last month
ag2ai / SimpleDoc
View on GitHub
☆41Jan 9, 2026Updated 6 months ago
shi-yx / URaG
View on GitHub
Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…
☆43Feb 4, 2026Updated 5 months ago
XinyuanLu00 / QACheck
View on GitHub
About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"
☆19Dec 19, 2023Updated 2 years ago
Liuziyu77 / MMDU
View on GitHub
Official repository of MMDU dataset
☆108Sep 29, 2024Updated last year
WING-NUS / Question-Generation-Paper-List
View on GitHub
A summary of must-read papers for Neural Question Generation (NQG)
☆14Nov 14, 2020Updated 5 years ago
TerryPei / CSP
View on GitHub
Cross-Self KV Cache Pruning for Efficient Vision-Language Inference
☆10Dec 15, 2024Updated last year
adlnlp / mmvqa
View on GitHub
☆19Sep 11, 2024Updated last year
PKU-SDS-lab / POQD-ICML25
View on GitHub
☆16Aug 28, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
illuin-tech / vidore-benchmark
View on GitHub
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆278Mar 25, 2026Updated 3 months ago
chenmeiqii / Teach-LLM-LR
View on GitHub
☆32Aug 30, 2024Updated last year
MRAMG-Bench / MRAMG
View on GitHub
[SIGIR 2025] Official impl. of "MRAMG-Bench: A Comprehensive Benchmark for Advancing Multimodal Retrieval-Augmented Multimodal Generation…
☆19Apr 15, 2025Updated last year
OpenGVLab / MMIU
View on GitHub
[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
☆98Sep 14, 2024Updated last year
SkiddieAhn / Code-MR-MKG
View on GitHub
[ACL 2024] Multimodal Reasoning with Multimodal Knowledge Graph (pytorch implementation)
☆17Jun 2, 2025Updated last year
aiming-lab / MDocAgent
View on GitHub
MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding
☆352Aug 8, 2025Updated 11 months ago
NExTplusplus / TAT-DQA
View on GitHub
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
☆24Sep 17, 2024Updated last year
adlnlp / pdfvqa
View on GitHub
☆18Jun 12, 2024Updated 2 years ago
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Alpha-Innovator / DocGenome
View on GitHub
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
☆156Jan 13, 2025Updated last year
facebookresearch / multimodal_rewardbench
View on GitHub
Multimodal RewardBench
☆68Feb 21, 2025Updated last year
mayubo2333 / fewshot_ED
View on GitHub
ACL'2023: Few-shot Event Detection: An Empirical Study and a Unified View
☆11Mar 13, 2024Updated 2 years ago
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
Liuziyu77 / RAR
View on GitHub
The official implementation of RAR
☆91Dec 9, 2025Updated 7 months ago
longvideobench / LongVideoBench
View on GitHub
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
☆133Jul 27, 2024Updated last year
nttmdlab-nlp / VDocRAG
View on GitHub
[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents
☆66May 26, 2025Updated last year
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
open-compass / VLMEvalKit
View on GitHub
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
☆4,299Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
OxRML / MADQA
View on GitHub
Multimodal Agentic Document QA benchmark (MADQA)
☆39Mar 13, 2026Updated 4 months ago
WING-NUS / ELCo
View on GitHub
The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024
☆16May 11, 2024Updated 2 years ago
Yuliang-Liu / MultimodalOCR
View on GitHub
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
☆873Updated this week
Alibaba-NLP / VRAG
View on GitHub
Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.
☆970Apr 29, 2026Updated 2 months ago
OpenMOSS / rope_pp
View on GitHub
[ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
☆33Dec 9, 2025Updated 7 months ago
OpenIXCLab / CODA
View on GitHub
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning
☆37Aug 28, 2025Updated 10 months ago
InternLM / Spark
View on GitHub
An official implementation of "SPARK: Synergistic Policy And Reward Co-Evolving Framework"
☆25Oct 23, 2025Updated 9 months ago