NormXU/ERNIE-Layout-Pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NormXU/ERNIE-Layout-Pytorch)

NormXU / ERNIE-Layout-Pytorch

An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.

☆107

Alternatives and similar repositories for ERNIE-Layout-Pytorch

Users that are interested in ERNIE-Layout-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NormXU / Layout2Graph
View on GitHub
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆82Oct 14, 2023Updated 2 years ago
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
MAEHCM / ICL-D3IE
View on GitHub
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆54Aug 8, 2023Updated 2 years ago
clovaai / bros
View on GitHub
☆163Dec 27, 2022Updated 3 years ago
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
allanj / LayoutLMv3-DocVQA
View on GitHub
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆53Sep 19, 2022Updated 3 years ago
rossumai / docile
View on GitHub
DocILE: Document Information Localization and Extraction Benchmark
☆149Jun 17, 2026Updated last month
wenwenyu / PICK-pytorch
View on GitHub
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…
☆568Jul 25, 2024Updated last year
chenxn2020 / GOSE
View on GitHub
[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"
☆17Dec 1, 2023Updated 2 years ago
google-research-datasets / vrdu
View on GitHub
We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…
☆83Feb 8, 2023Updated 3 years ago
ZZR8066 / GraphDoc
View on GitHub
☆45Jul 18, 2022Updated 4 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
weijiawu / TransVTSpotter
View on GitHub
A new video text spotting framework with Transformer
☆82May 23, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
NormXU / DocParser-Pytorch
View on GitHub
An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
☆38Sep 9, 2023Updated 2 years ago
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆652Aug 12, 2024Updated last year
prohandler / GS-Bulk-Emails
View on GitHub
Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email
☆17Dec 11, 2024Updated last year
applicaai / kleister-nda
View on GitHub
☆61Aug 18, 2021Updated 4 years ago
jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
adlnlp / doc_gcn
View on GitHub
☆19May 30, 2023Updated 3 years ago
WenjinW / LATIN-Prompt
View on GitHub
☆52May 28, 2024Updated 2 years ago
doc-analysis / XFUND
View on GitHub
XFUND: A Multilingual Form Understanding Benchmark
☆223Jul 15, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
hikopensource / DAVAR-Lab-OCR
View on GitHub
OCR toolbox from Davar-Lab
☆762Jun 29, 2026Updated 3 weeks ago
HCIILAB / EPHOIE
View on GitHub
☆110Feb 16, 2021Updated 5 years ago
littletomatodonkey / Augment-XY-CUT
View on GitHub
an unofficial code for augment-XY-CUT in XYLayoutLM
☆30Jul 12, 2022Updated 4 years ago
ZZR8066 / SEMv2
View on GitHub
☆71Jun 26, 2024Updated 2 years ago
bilal-rachik / Information-extraction-from-document
View on GitHub
Graph Key Information Extraction: GKIE
☆11Sep 15, 2022Updated 3 years ago
PaddlePaddle / VIMER
View on GitHub
视觉预训练基础模型仓库
☆500Apr 12, 2023Updated 3 years ago
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,833Mar 17, 2026Updated 4 months ago
JiaquanYe / TableMASTER-mmocr
View on GitHub
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
☆470Jul 4, 2022Updated 4 years ago
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Oct 30, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JPLeoRX / detectron2-publaynet
View on GitHub
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆50Apr 16, 2023Updated 3 years ago
machine-intelligence-laboratory / DDI-100
View on GitHub
Distorted Document Images dataset (DDI-100).
☆146Nov 1, 2022Updated 3 years ago
Theivaprakasham / layoutlmv3
View on GitHub
This Repository consists of all my experiments performed on LayoutLMv3 model.
☆36Aug 11, 2022Updated 3 years ago
namtuanly / WikiTableSet
View on GitHub
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆32Jun 12, 2025Updated last year
MAEHCM / AET
View on GitHub
Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”
☆18Dec 6, 2022Updated 3 years ago
microsoft / i-Code
View on GitHub
☆1,705Sep 27, 2024Updated last year
entropy2333 / awesome-key-information-extraction
View on GitHub
A curated list of papers about key information extraction.
☆107Jul 8, 2026Updated 2 weeks ago