CaseDrive / publaynet-models
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆27Updated 2 years ago
Alternatives and similar repositories for publaynet-models
Users that are interested in publaynet-models are comparing it to the libraries listed below
Sorting:
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆49Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆105Updated 8 months ago
- A Unified Toolkit for Deep Learning-Based Table Extraction☆35Updated 5 months ago
- ☆22Updated last year
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆340Updated 2 years ago
- Dataset and scripts for HRDoc☆37Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆190Updated 2 months ago
- Object Detection Model for Scanned Documents☆93Updated 2 months ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆73Updated last month
- ☆82Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆38Updated 2 months ago
- ☆80Updated 3 years ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…☆45Updated 10 months ago
- YOLOv10 trained on DocLayNet dataset.☆72Updated 6 months ago
- ☆126Updated last week
- Table Structure Recognition☆74Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆43Updated last year
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆181Updated 8 months ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆142Updated last week
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆106Updated 2 months ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆126Updated last year
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆50Updated 2 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆121Updated last year
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆100Updated 11 months ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆176Updated 2 years ago
- 阅读顺序、Layoutreader☆12Updated last week
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆78Updated last year
- ICDAR 2024 Table OCR Model☆34Updated 5 months ago