icip-cas / READocLinks
☆21Updated 2 months ago
Alternatives and similar repositories for READoc
Users that are interested in READoc are comparing it to the libraries listed below
Sorting:
- ☆88Updated 3 years ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆209Updated 11 months ago
- A large scale camera-taken table detection and recognition dataset.☆140Updated last month
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Updated last year
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆261Updated 8 months ago
- ☆145Updated 3 months ago
- This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy☆44Updated 10 months ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆199Updated 6 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆214Updated 2 months ago
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆105Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆275Updated 3 weeks ago
- UniTable: Towards a Unified Table Foundation Model☆504Updated last year
- Document Artifical Intelligence☆188Updated 4 months ago
- Table Structure Recognition☆76Updated 2 years ago
- ☆65Updated last year
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆22Updated 8 months ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆123Updated 3 months ago
- Repo☆12Updated 3 years ago
- PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析☆61Updated 9 months ago
- On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)☆699Updated 2 months ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆106Updated last year
- https://dl.acm.org/doi/10.1145/3657281☆98Updated last year
- CDLA: A Chinese document layout analysis (CDLA) dataset☆276Updated 3 years ago
- ICDAR 2024 Table OCR Model☆36Updated last month
- A curated list of resources dedicated to table recognition☆403Updated 8 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆371Updated 2 years ago
- XFUND: A Multilingual Form Understanding Benchmark☆208Updated 3 years ago
- ☆38Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆228Updated 4 months ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆148Updated 3 months ago