DS4SD / DocLayNet
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
☆336Updated 2 years ago
Alternatives and similar repositories for DocLayNet:
Users that are interested in DocLayNet are comparing it to the libraries listed below
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆187Updated last month
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆346Updated 2 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆603Updated 8 months ago
- UniTable: Towards a Unified Table Foundation Model☆461Updated 10 months ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆140Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆104Updated 7 months ago
- Document Layout Analysis☆365Updated this week
- Document Layout Analysis resources repos for development with PdfPig.☆611Updated last year
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆175Updated 2 years ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆175Updated 7 months ago
- DocILE: Document Information Localization and Extraction Benchmark☆125Updated 11 months ago
- YOLOv10 trained on DocLayNet dataset.☆73Updated 5 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆231Updated 4 months ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆210Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆271Updated 2 years ago
- ☆438Updated 3 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆210Updated 10 months ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆120Updated last year
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆100Updated last month
- Object Detection Model for Scanned Documents☆91Updated last month
- ☆124Updated this week
- XFUND: A Multilingual Form Understanding Benchmark☆200Updated 2 years ago
- Table structure recognition dataset of the paper: Complicated Table Structure Recognition☆360Updated 4 years ago
- ☆159Updated 2 years ago
- A curated list of resources dedicated to table recognition☆401Updated 4 months ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆105Updated last year
- A large scale camera-taken table detection and recognition dataset.☆124Updated last year
- ☆82Updated 2 years ago
- https://dl.acm.org/doi/10.1145/3657281☆96Updated 11 months ago
- ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...☆179Updated 3 years ago