Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tabular MLLM named Table-LLaVA.
☆226Jun 12, 2025Updated 9 months ago
Alternatives and similar repositories for Table-LLaVA
Users that are interested in Table-LLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理☆613Dec 15, 2025Updated 3 months ago
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆644Apr 22, 2024Updated last year
- UniTable: Towards a Unified Table Foundation Model☆529Jun 4, 2024Updated last year
- This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy☆50Oct 16, 2024Updated last year
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆226Sep 9, 2024Updated last year
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆233Dec 17, 2025Updated 3 months ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 9 months ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 6 months ago
- Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…☆15Jun 5, 2024Updated last year
- ☆102Dec 23, 2024Updated last year
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆276Dec 6, 2025Updated 3 months ago
- ☆143Feb 13, 2024Updated 2 years ago
- Resources on Large Language Models for Table Processing☆110Oct 24, 2024Updated last year
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 8 months ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆107Dec 16, 2025Updated 3 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆135May 14, 2024Updated last year
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated 11 months ago
- ICDAR 2024 Table OCR Model☆39Feb 25, 2026Updated 3 weeks ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆149Sep 10, 2024Updated last year
- ☆41Jun 15, 2024Updated last year
- ☆46May 21, 2024Updated last year
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated 11 months ago
- Document Artifical Intelligence☆202Sep 28, 2025Updated 5 months ago
- Table Structure Recognition☆82Mar 11, 2023Updated 3 years ago
- TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data☆28Feb 28, 2024Updated 2 years ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆84Jun 20, 2023Updated 2 years ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,375May 30, 2025Updated 9 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆194May 31, 2024Updated last year
- [ICCV2025] A Token-level Text Image Foundation Model for Document Understanding☆132Aug 27, 2025Updated 6 months ago
- Dataset and Code for ACL 2023 paper: "IM-TQA: A Chinese Table Question Answering Dataset with Implicit and Multi-type Table Structures". …☆27Aug 6, 2024Updated last year
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,897Dec 30, 2024Updated last year
- The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …☆14Jun 23, 2024Updated last year
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,825Mar 17, 2026Updated last week
- A curated list of resources dedicated to table recognition☆405Dec 12, 2024Updated last year
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.☆181Jul 7, 2025Updated 8 months ago
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Apr 15, 2025Updated 11 months ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆128Jun 4, 2025Updated 9 months ago
- ☆56Oct 30, 2024Updated last year