BordiaS/layoutlm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BordiaS/layoutlm)

BordiaS / layoutlm

☆97

Alternatives and similar repositories for layoutlm

Users that are interested in layoutlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cydal / LayoutLM_pytorch
View on GitHub
Text and Layout Document Image Understanding. LayoutLM
☆22Sep 22, 2021Updated 4 years ago
omarsou / layoutlm_CORD
View on GitHub
Evaluation of the Layoutlm model on the CORD dataset
☆32Feb 4, 2022Updated 4 years ago
prohandler / GS-Bulk-Emails
View on GitHub
Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email
☆17Dec 11, 2024Updated last year
ruifcruz / sroie-on-layoutlm
View on GitHub
☆42Feb 6, 2021Updated 5 years ago
microsoft / TAP
View on GitHub
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)
☆72May 22, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wenwenyu / PICK-pytorch
View on GitHub
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…
☆568Jul 25, 2024Updated 2 years ago
bikash / DocumentUnderstanding
View on GitHub
Research papers and code on information extraction from image/pdf
☆97Nov 25, 2022Updated 3 years ago
zhaominyiz / EPiDA
View on GitHub
Official Code for 'EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification' - NAACL 2022
☆23May 9, 2022Updated 4 years ago
beacandler / EATEN
View on GitHub
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
☆183Dec 29, 2019Updated 6 years ago
clovaai / cord
View on GitHub
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
☆486Jul 20, 2022Updated 4 years ago
vaibhavshukla182 / extracting_text_information_using_YOLO
View on GitHub
☆13Oct 31, 2018Updated 7 years ago
allanj / LayoutLMv3-DocVQA
View on GitHub
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆53Sep 19, 2022Updated 3 years ago
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
benywon / ComQA
View on GitHub
Comostional question answering
☆17Jun 18, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zhaominyiz / STIRER
View on GitHub
STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023
☆14Dec 2, 2024Updated last year
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆653Aug 12, 2024Updated last year
terrierteam / pyterrier_t5
View on GitHub
☆17Apr 30, 2026Updated 2 months ago
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
rohitsaluja22 / OCR-On-the-go
View on GitHub
For ICDAR 2019 Paper on End-to-end License Plate and Scene Text Recognition with multi-head attention models
☆25Aug 14, 2021Updated 4 years ago
ibm-aur-nlp / PubLayNet
View on GitHub
☆1,053Jul 9, 2025Updated last year
yashkant / sam-textvqa
View on GitHub
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
☆65Sep 15, 2021Updated 4 years ago
bilal-rachik / Information-extraction-from-document
View on GitHub
Graph Key Information Extraction: GKIE
☆11Sep 15, 2022Updated 3 years ago
bytedance / VTVQA
View on GitHub
Towards Video Text Visual Question Answering: Benchmark and Baseline
☆41Feb 26, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
doc-analysis / XFUND
View on GitHub
XFUND: A Multilingual Form Understanding Benchmark
☆223Jul 15, 2022Updated 4 years ago
doc-analysis / DocBankLoader
View on GitHub
DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.
☆24Mar 17, 2021Updated 5 years ago
cvzoya / visuallydata
View on GitHub
A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations
☆17Oct 8, 2018Updated 7 years ago
aioz-ai / CFR_VQA
View on GitHub
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
☆49Apr 22, 2026Updated 3 months ago
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Updated this week
applicaai / lambert
View on GitHub
Publicly released code for the LAMBERT model
☆106Jun 14, 2021Updated 5 years ago
HCIILAB / EPHOIE
View on GitHub
☆110Feb 16, 2021Updated 5 years ago
doc-analysis / TableBank
View on GitHub
TableBank: A Benchmark Dataset for Table Detection and Recognition
☆1,080Aug 12, 2024Updated last year
luogen1996 / LWTransformer
View on GitHub
Lightweight Transformer for Multi-modal Tasks
☆16Dec 9, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
LivingSkyTechnologies / Dense_Article_Dataset_DAD
View on GitHub
Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis
☆16Jan 13, 2022Updated 4 years ago
husterpzh / PSSR
View on GitHub
Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration （CVPR2023）"
☆10May 15, 2024Updated 2 years ago
dot-legal / reference
View on GitHub
Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.
☆13Jul 12, 2022Updated 4 years ago
Academic-Hammer / SciTSR
View on GitHub
Table structure recognition dataset of the paper: Complicated Table Structure Recognition
☆384Jul 7, 2020Updated 6 years ago
MBAigner / PDFSegmenter
View on GitHub
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…
☆23Sep 11, 2020Updated 5 years ago
CharlesWu123 / SPLERGE
View on GitHub
Deep Splitting and Merging for Table Structure Decomposition
☆67Jul 23, 2023Updated 3 years ago
VDIGPKU / STR_TPSearch
View on GitHub
☆21Mar 15, 2022Updated 4 years ago