CaseDrive / publaynet-modelsLinks
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆28Updated 2 years ago
Alternatives and similar repositories for publaynet-models
Users that are interested in publaynet-models are comparing it to the libraries listed below
Sorting:
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆48Updated 2 years ago
- Object Detection Model for Scanned Documents☆93Updated 3 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆349Updated 2 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆194Updated 3 months ago
- Dataset and scripts for HRDoc☆38Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆106Updated 10 months ago
- Datasets and Evaluation Scripts for CompHRDoc☆44Updated 4 months ago
- ☆130Updated last month
- ☆84Updated 2 years ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆191Updated 9 months ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆106Updated last year
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆145Updated last month
- https://dl.acm.org/doi/10.1145/3657281☆96Updated last year
- 阅读顺序、Layoutreader☆16Updated last month
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆73Updated 2 months ago
- YOLOv10 trained on DocLayNet dataset.☆75Updated 7 months ago
- Table Structure Recognition☆77Updated 2 years ago
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆115Updated 3 months ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year
- XFUND: A Multilingual Form Understanding Benchmark☆205Updated 2 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- ICDAR 2024 Table OCR Model☆34Updated 6 months ago
- A Unified Toolkit for Deep Learning-Based Table Extraction☆40Updated 7 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆252Updated 2 weeks ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Updated 2 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆178Updated 2 years ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…☆46Updated 11 months ago
- ☆32Updated last year
- ☆22Updated last year