shabie/docformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shabie/docformer)

shabie / docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

☆290

Alternatives and similar repositories for docformer

Users that are interested in docformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
uakarsh / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆24Aug 3, 2023Updated 2 years ago
applicaai / lambert
View on GitHub
Publicly released code for the LAMBERT model
☆106Jun 14, 2021Updated 5 years ago
clovaai / spade
View on GitHub
☆82Jun 12, 2023Updated 3 years ago
clovaai / bros
View on GitHub
☆163Dec 27, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tstanislawek / awesome-document-understanding
View on GitHub
A curated list of resources for Document Understanding (DU) topic
☆1,525Jun 2, 2023Updated 3 years ago
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆653Aug 12, 2024Updated last year
microsoft / UDOP
View on GitHub
☆250Jan 22, 2023Updated 3 years ago
NormXU / Layout2Graph
View on GitHub
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆82Oct 14, 2023Updated 2 years ago
wenwenyu / PICK-pytorch
View on GitHub
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…
☆568Jul 25, 2024Updated 2 years ago
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
herobd / dessurt
View on GitHub
Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer
☆62Jan 11, 2023Updated 3 years ago
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bikash / DocumentUnderstanding
View on GitHub
Research papers and code on information extraction from image/pdf
☆97Nov 25, 2022Updated 3 years ago
hikopensource / DAVAR-Lab-OCR
View on GitHub
OCR toolbox from Davar-Lab
☆762Jun 29, 2026Updated 3 weeks ago
NormXU / ERNIE-Layout-Pytorch
View on GitHub
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
☆107Nov 15, 2023Updated 2 years ago
uakarsh / TiLT-Implementation
View on GitHub
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
☆18Apr 23, 2023Updated 3 years ago
sachinraja13 / TabStructNet
View on GitHub
☆132Mar 24, 2023Updated 3 years ago
rossumai / docile
View on GitHub
DocILE: Document Information Localization and Extraction Benchmark
☆149Jun 17, 2026Updated last month
allenai / mmda
View on GitHub
multimodal document analysis
☆166May 14, 2026Updated 2 months ago
doc-analysis / ReadingBank
View on GitHub
ReadingBank: A Benchmark Dataset for Reading Order Detection
☆117Aug 26, 2024Updated last year
qurator-spk / eynollah
View on GitHub
Document Layout Analysis
☆408Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
clovaai / synthtiger
View on GitHub
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
☆579Jun 14, 2024Updated 2 years ago
philschmid / document-ai-transformers
View on GitHub
☆399Jan 7, 2024Updated 2 years ago
clovaai / cord
View on GitHub
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
☆485Jul 20, 2022Updated 4 years ago
clovaai / donut
View on GitHub
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,908Jul 11, 2024Updated 2 years ago
cneud / ocr-gt
View on GitHub
OCR & Ground Truth Resources
☆78May 3, 2022Updated 4 years ago
adlnlp / doc_gcn
View on GitHub
☆19May 30, 2023Updated 3 years ago
byeonghu-na / MATRN
View on GitHub
Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …
☆74Jun 24, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Academic-Hammer / SciTSR
View on GitHub
Table structure recognition dataset of the paper: Complicated Table Structure Recognition
☆383Jul 7, 2020Updated 6 years ago
HCIILAB / EPHOIE
View on GitHub
☆110Feb 16, 2021Updated 5 years ago
doc-analysis / XFUND
View on GitHub
XFUND: A Multilingual Form Understanding Benchmark
☆223Jul 15, 2022Updated 4 years ago
ibm-aur-nlp / PubLayNet
View on GitHub
☆1,053Jul 9, 2025Updated last year
google-research-datasets / hiertext
View on GitHub
The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…
☆316Dec 2, 2024Updated last year
ibm-aur-nlp / PubTabNet
View on GitHub
☆484Jul 8, 2025Updated last year
andreagemelli / doc2graph
View on GitHub
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆139Oct 18, 2025Updated 9 months ago