herobd/dessurt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/herobd/dessurt)

herobd / dessurt

Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer

☆62

Alternatives and similar repositories for dessurt

Users that are interested in dessurt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rossumai / docile
View on GitHub
DocILE: Document Information Localization and Extraction Benchmark
☆149Jun 17, 2026Updated last month
WenjinW / LATIN-Prompt
View on GitHub
☆52May 28, 2024Updated 2 years ago
herobd / FUDGE
View on GitHub
Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"
☆33Mar 4, 2022Updated 4 years ago
jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
microsoft / UDOP
View on GitHub
☆250Jan 22, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
due-benchmark / baselines
View on GitHub
The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."
☆36Mar 2, 2023Updated 3 years ago
huggingface / docmatix
View on GitHub
A huge dataset for Document Visual Question Answering
☆24Jul 29, 2024Updated last year
naver-ai / cream
View on GitHub
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023
☆46Jun 11, 2024Updated 2 years ago
NormXU / DocParser-Pytorch
View on GitHub
An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
☆38Sep 9, 2023Updated 2 years ago
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
wanghaisheng / ocr-arxiv-daily
View on GitHub
☆19Jun 7, 2023Updated 3 years ago
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
allanj / LayoutLMv3-DocVQA
View on GitHub
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆53Sep 19, 2022Updated 3 years ago
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
andreagemelli / doc2graph
View on GitHub
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆139Oct 18, 2025Updated 9 months ago
Ucas-HaoranWei / Vary-family
View on GitHub
☆57Jan 23, 2024Updated 2 years ago
usydnlp / vdoc
View on GitHub
☆15Sep 7, 2022Updated 3 years ago
littletomatodonkey / Augment-XY-CUT
View on GitHub
an unofficial code for augment-XY-CUT in XYLayoutLM
☆30Jul 12, 2022Updated 4 years ago
google-research / pix2struct
View on GitHub
☆685Jul 8, 2026Updated 2 weeks ago
FactoDeepLearning / LinePytorchOCR
View on GitHub
☆17Feb 16, 2023Updated 3 years ago
FactoDeepLearning / DAN
View on GitHub
☆12Jun 13, 2025Updated last year
ZZR8066 / GraphDoc
View on GitHub
☆45Jul 18, 2022Updated 4 years ago
MAEHCM / AET
View on GitHub
Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”
☆18Dec 6, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
clovaai / units
View on GitHub
☆78Aug 7, 2023Updated 2 years ago
ai-forever / ScrabbleGAN
View on GitHub
Handwritten Text Generation
☆17Oct 17, 2022Updated 3 years ago
salesforce / QVR-SimpleDLM
View on GitHub
Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.
☆16May 1, 2025Updated last year
lulia0228 / Document_IE
View on GitHub
GCN use for semi-construct document information extraction.
☆21Aug 5, 2023Updated 2 years ago
clovaai / synthtiger
View on GitHub
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
☆578Jun 14, 2024Updated 2 years ago
clovaai / spade
View on GitHub
☆82Jun 12, 2023Updated 3 years ago
winter1203 / vllm_GOT2_OCR
View on GitHub
Accelerating GOT-OCRv2 with VLLM
☆10Nov 15, 2024Updated last year
Dawars / DocMAE
View on GitHub
Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning
☆20Dec 20, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
GoodNotes / GNHK-dataset
View on GitHub
☆19Mar 28, 2022Updated 4 years ago
NiteshMethani / PlotQA
View on GitHub
Dataset introduced in PlotQA: Reasoning over Scientific Plots
☆83Jun 20, 2023Updated 3 years ago
big-o / transvec
View on GitHub
Translate word embeddings across models
☆10Aug 17, 2020Updated 5 years ago
ayanban011 / SwinDocSegmenter
View on GitHub
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆74Sep 12, 2024Updated last year
tstanislawek / awesome-document-understanding
View on GitHub
A curated list of resources for Document Understanding (DU) topic
☆1,525Jun 2, 2023Updated 3 years ago
thanhhau097 / chargrid2d
View on GitHub
☆16Aug 12, 2021Updated 4 years ago
ucaslcl / Fox
View on GitHub
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
☆196May 31, 2024Updated 2 years ago