google-research-datasets/vrdu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research-datasets/vrdu)

google-research-datasets / vrdu

We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datasets that represent several challenges: rich schema including diverse data types, complex templates, and diversity of layouts within a single document type.

☆83

Alternatives and similar repositories for vrdu

Users that are interested in vrdu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
NormXU / ERNIE-Layout-Pytorch
View on GitHub
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
☆107Nov 15, 2023Updated 2 years ago
rossumai / docile
View on GitHub
DocILE: Document Information Localization and Extraction Benchmark
☆149Jun 17, 2026Updated last month
HCIILAB / M6Doc
View on GitHub
☆164May 8, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
jiangxiluning / MASTER-TF
View on GitHub
MASTER
☆140Mar 24, 2023Updated 3 years ago
jaywalnut310 / linear-transformer-for-table-recognition
View on GitHub
code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)
☆22Jun 16, 2021Updated 5 years ago
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
HCIILAB / EPHOIE
View on GitHub
☆110Feb 16, 2021Updated 5 years ago
entropy2333 / awesome-key-information-extraction
View on GitHub
A curated list of papers about key information extraction.
☆107Jul 8, 2026Updated 2 weeks ago
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
sachinraja13 / TabStructNet
View on GitHub
☆132Mar 24, 2023Updated 3 years ago
hpanwar08 / document-layout-analysis-app
View on GitHub
Simple docker deployment of document layout analysis using detectron2
☆19Nov 7, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
large-ocr-model / large-ocr-model.github.io
View on GitHub
☆189Feb 27, 2024Updated 2 years ago
NormXU / DocParser-Pytorch
View on GitHub
An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
☆38Sep 9, 2023Updated 2 years ago
guoxy25 / Ocean-OCR
View on GitHub
☆48Feb 7, 2025Updated last year
RichSu95 / Document_Binarization_Collection
View on GitHub
This repository is a concise collection of well known deep learning based document binarization models.
☆30Dec 24, 2022Updated 3 years ago
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
doc-analysis / ReadingBank
View on GitHub
ReadingBank: A Benchmark Dataset for Reading Order Detection
☆117Aug 26, 2024Updated last year
buptlihang / CDLA
View on GitHub
CDLA: A Chinese document layout analysis (CDLA) dataset
☆294Sep 13, 2021Updated 4 years ago
IBM / ICDAR2021-SLP
View on GitHub
ICDAR 2021 Competition on Scientific Literature Parsing
☆35Aug 20, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
gwxie / Distorted-Image-With-Flow
View on GitHub
☆43Mar 26, 2022Updated 4 years ago
wangwen-whu / WTW-Dataset
View on GitHub
This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …
☆184Sep 15, 2021Updated 4 years ago
castorini / dhr
View on GitHub
Dense hybrid representations for text retrieval
☆65Apr 3, 2023Updated 3 years ago
JiaquanYe / TableMASTER-mmocr
View on GitHub
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
☆470Jul 4, 2022Updated 4 years ago
doc-analysis / XFUND
View on GitHub
XFUND: A Multilingual Form Understanding Benchmark
☆223Jul 15, 2022Updated 4 years ago
microsoft / table-transformer
View on GitHub
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…
☆2,930Jun 24, 2024Updated 2 years ago
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
clovaai / bros
View on GitHub
☆163Dec 27, 2022Updated 3 years ago
NormXU / Layout2Graph
View on GitHub
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆82Oct 14, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Chuhanxx / FontAdaptor
View on GitHub
Data and implementation of ECCV2020 paper 'Adaptive Text Recognition through Visual Matching'
☆124Nov 22, 2022Updated 3 years ago
wenwenyu / MASTER-pytorch
View on GitHub
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
☆281Dec 26, 2021Updated 4 years ago
tingyaohsu / SciCap
View on GitHub
SciCap Dataset
☆59Nov 5, 2021Updated 4 years ago
ZeningLin / ViBERTgrid-PyTorch
View on GitHub
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…
☆53Jan 9, 2024Updated 2 years ago
clovaai / spade
View on GitHub
☆82Jun 12, 2023Updated 3 years ago
ibm-aur-nlp / PubLayNet
View on GitHub
☆1,052Jul 9, 2025Updated last year
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago