This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
☆50Oct 16, 2024Updated last year
Alternatives and similar repositories for TabPedia
Users that are interested in TabPedia are comparing it to the libraries listed below
Sorting:
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated last year
- ☆19Mar 10, 2023Updated 2 years ago
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆36Jul 22, 2025Updated 7 months ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- ☆75Jul 31, 2025Updated 7 months ago
- ☆13May 26, 2025Updated 9 months ago
- ☆17Oct 6, 2024Updated last year
- The official repo for “WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?“☆72May 19, 2025Updated 9 months ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆55Jun 14, 2024Updated last year
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆226Jun 12, 2025Updated 8 months ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆63May 15, 2025Updated 9 months ago
- ☆14May 26, 2023Updated 2 years ago
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆13Dec 21, 2023Updated 2 years ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆27Dec 18, 2025Updated 2 months ago
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆643Apr 22, 2024Updated last year
- sakura2233565548 / Self-Supervised-Representation-Learning-with-Spatial-Temporal-Consistency-for-SLRThis repository is the source code for Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recogn…☆16Apr 27, 2024Updated last year
- ☆18Nov 30, 2025Updated 3 months ago
- time-series row column classification☆14Jan 7, 2022Updated 4 years ago
- Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning☆20Dec 20, 2023Updated 2 years ago
- ☆38Oct 20, 2023Updated 2 years ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆48Aug 26, 2024Updated last year
- ☆32Apr 8, 2025Updated 11 months ago
- The official implementation of SPTS v2: Single-Point Text Spotting☆140Jun 29, 2023Updated 2 years ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…☆51Jul 3, 2024Updated last year
- [ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning☆73Dec 17, 2025Updated 2 months ago
- https://dl.acm.org/doi/10.1145/3657281☆97Apr 25, 2024Updated last year
- ☆27Feb 20, 2024Updated 2 years ago
- ☆102Dec 23, 2024Updated last year
- DocTr++ in PaddlePaddle☆58Jul 24, 2024Updated last year
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- Synthetic identity documents dataset☆35Mar 4, 2025Updated last year
- Official PyTorch implementation for ACM MM22 "UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior"☆25Aug 5, 2024Updated last year
- A curated list of papers about key information extraction.☆105Dec 18, 2024Updated last year
- TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition☆102Dec 9, 2021Updated 4 years ago
- This is an easy to understand, simplified, broken-down implementation of Diffusion Models written in PyTorch. The architecture is borrowe…☆27Aug 18, 2023Updated 2 years ago
- 通过浏览器渲染生成表格图像☆236Apr 10, 2024Updated last year
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- The official code for “SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning”, ICCV, 20…☆31Jul 21, 2024Updated last year
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆31Mar 13, 2024Updated last year