JPLeoRX / detectron2-publaynetView external linksLinks
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆50Apr 16, 2023Updated 2 years ago
Alternatives and similar repositories for detectron2-publaynet
Users that are interested in detectron2-publaynet are comparing it to the libraries listed below
Sorting:
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆29Apr 16, 2023Updated 2 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 7 months ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- A curated list of resources on Document Layout Analysis☆11Aug 7, 2025Updated 6 months ago
- ☆12Jun 11, 2023Updated 2 years ago
- [ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents☆16Apr 4, 2024Updated last year
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- Detectron2 for Document Layout Analysis☆187Aug 2, 2024Updated last year
- ☆20Nov 3, 2022Updated 3 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- ☆14May 26, 2023Updated 2 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆76Sep 12, 2024Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated last year
- Document Layout Analysis☆395Updated this week
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- Finetune LayoutLM on SROIE dataset using W&B tools☆19Dec 2, 2021Updated 4 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Feb 16, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆56Feb 25, 2025Updated 11 months ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆86Feb 11, 2023Updated 3 years ago
- ☆27Nov 29, 2023Updated 2 years ago
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆30Jan 19, 2026Updated 3 weeks ago
- Geometry Normalization Networks for Accurate Scene Text Detection (iccv 2019)☆21Apr 3, 2020Updated 5 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆218Sep 26, 2023Updated 2 years ago
- ☆1,038Jul 9, 2025Updated 7 months ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆633Aug 12, 2024Updated last year
- Research papers and code on information extraction from image/pdf☆97Nov 25, 2022Updated 3 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Mar 17, 2021Updated 4 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- ☆29Aug 31, 2022Updated 3 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆203Mar 1, 2025Updated 11 months ago
- ☆38Feb 4, 2023Updated 3 years ago
- A curated list of resources dedicated to table recognition☆406Dec 12, 2024Updated last year
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆81Oct 14, 2023Updated 2 years ago
- XFUND: A Multilingual Form Understanding Benchmark☆217Jul 15, 2022Updated 3 years ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆58May 26, 2025Updated 8 months ago
- Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'☆83Feb 28, 2018Updated 7 years ago