Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations
☆136Sep 28, 2025Updated 7 months ago
Alternatives and similar repositories for MMLongBench-Doc
Users that are interested in MMLongBench-Doc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆43Jul 28, 2025Updated 9 months ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆105Mar 31, 2025Updated last year
- About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"☆19Dec 19, 2023Updated 2 years ago
- Official repository of MMDU dataset☆107Sep 29, 2024Updated last year
- A summary of must-read papers for Neural Question Generation (NQG)☆14Nov 14, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆40Apr 6, 2026Updated 3 weeks ago
- ☆32Aug 30, 2024Updated last year
- Crawled Wikipedia Tables with Passages☆14Aug 19, 2021Updated 4 years ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024☆16May 11, 2024Updated last year
- ACL'2023: Few-shot Event Detection: An Empirical Study and a Unified View☆11Mar 13, 2024Updated 2 years ago
- ☆16Jul 23, 2024Updated last year
- LMM for VQA, tcsvt version☆10Jul 19, 2024Updated last year
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆202Apr 13, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.☆124Jul 27, 2024Updated last year
- ☆19Sep 11, 2024Updated last year
- ☆70Jan 9, 2024Updated 2 years ago
- ☆22Nov 5, 2024Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆271Mar 25, 2026Updated last month
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- This is the official repository for Retrieval Augmented Visual Question Answering☆248Dec 19, 2024Updated last year
- ☆31Dec 9, 2023Updated 2 years ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Jan 9, 2026Updated 3 months ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated 2 years ago
- ☆12Jun 20, 2023Updated 2 years ago
- This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.☆18Dec 27, 2022Updated 3 years ago
- ☆17Oct 22, 2024Updated last year
- [ICLR2025] Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want☆96Dec 1, 2025Updated 5 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆48Oct 10, 2025Updated 6 months ago
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…☆111Jul 9, 2025Updated 9 months ago
- Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…☆41Feb 4, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Dec 21, 2023Updated 2 years ago
- [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆207Sep 26, 2024Updated last year
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆56May 3, 2025Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆21Dec 4, 2024Updated last year
- Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks☆4,096Updated this week
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆64May 15, 2025Updated 11 months ago