NormXU/DocParser-Pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NormXU/DocParser-Pytorch)

NormXU / DocParser-Pytorch

An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents

☆38

Alternatives and similar repositories for DocParser-Pytorch

Users that are interested in DocParser-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mosecorg / numbin
View on GitHub
An efficient binary serialization format for numerical data.
☆18Nov 3, 2025Updated 8 months ago
GuangtaoLyu / PSSTRNet
View on GitHub
☆13Jul 28, 2024Updated last year
vis-nlp / OpenCQA
View on GitHub
☆13Jun 20, 2023Updated 3 years ago
NormXU / ERNIE-Layout-Pytorch
View on GitHub
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
☆107Nov 15, 2023Updated 2 years ago
NormXU / Layout2Graph
View on GitHub
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆82Oct 14, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
Weifeng2Wu / ICDAR-2023-DTT-in-Images-1
View on GitHub
☆12Mar 20, 2023Updated 3 years ago
huggingface / docmatix
View on GitHub
A huge dataset for Document Visual Question Answering
☆24Jul 29, 2024Updated last year
entropy2333 / awesome-key-information-extraction
View on GitHub
A curated list of papers about key information extraction.
☆107Jul 8, 2026Updated last week
clovaai / bros
View on GitHub
☆163Dec 27, 2022Updated 3 years ago
tanguymagne / UVDoc
View on GitHub
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
☆222Jul 28, 2024Updated last year
cklapperich / DocumentContextExtractor
View on GitHub
☆12Jan 25, 2025Updated last year
dswang2011 / DocLLM
View on GitHub
DocLLM: A layout-aware generative language model for multimodal document understanding
☆142Jan 3, 2024Updated 2 years ago
Rapisurazurite / FFDN
View on GitHub
Implementation for Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition
☆30Feb 26, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
GilhanPark / Korean_license_plate_recognition
View on GitHub
Recognition KLP using Yolov4 + LPRnet🔥🔥
☆11Jan 5, 2022Updated 4 years ago
simplify23 / Light-STR-Competition-No.5
View on GitHub
轻量级文字识别技术创新大赛终榜第5名
☆15Jul 15, 2021Updated 5 years ago
namtuanly / WikiTableSet
View on GitHub
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆32Jun 12, 2025Updated last year
bilal-rachik / Information-extraction-from-document
View on GitHub
Graph Key Information Extraction: GKIE
☆11Sep 15, 2022Updated 3 years ago
MaxKinny / TabRecSet
View on GitHub
A large scale camera-taken table detection and recognition dataset.
☆150Apr 9, 2026Updated 3 months ago
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
google-research-datasets / vrdu
View on GitHub
We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…
☆83Feb 8, 2023Updated 3 years ago
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ZeningLin / ViBERTgrid-PyTorch
View on GitHub
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…
☆53Jan 9, 2024Updated 2 years ago
NormXU / nougat-latex-ocr
View on GitHub
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
☆160Sep 25, 2024Updated last year
shannanyinxiang / ViTEraser
View on GitHub
Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…
☆66Jul 4, 2024Updated 2 years ago
YukunLi99 / CoLeCLIP
View on GitHub
CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning
☆17Mar 21, 2024Updated 2 years ago
RamonKaspar / MathPrompter
View on GitHub
MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…
☆16Apr 12, 2025Updated last year
PRITHIVSAKTHIUR / OCR-ReportLab-Notebooks
View on GitHub
A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier
☆25Feb 12, 2026Updated 5 months ago
sufenlp / MiLoRA
View on GitHub
[NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning
☆21May 31, 2025Updated last year
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
dennislamcv1 / IBMDEXCELR
View on GitHub
IBM Data Analytics with Excel and R Professional Certificate
☆10Mar 6, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
namtuanly / MTL-TabNet
View on GitHub
MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition
☆103May 30, 2024Updated 2 years ago
Noba1anc3 / Document-Analysis-Recognition
View on GitHub
☆17Jan 23, 2021Updated 5 years ago
KahimWong / ADCD-Net
View on GitHub
[ICCV'25] ADCD-Net: Robust Document Image Forgery Localization via Adaptive DCT Feature and Hierarchical Content Disentanglement
☆26Mar 29, 2026Updated 3 months ago
jinpeng0528 / STAR
View on GitHub
Code release for "Saving 100x Storage: Prototype Replay for Reconstructing Training Sample Distribution in Class-Incremental Semantic Seg…
☆20Mar 19, 2025Updated last year
jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
fidler-lab / hila
View on GitHub
Official PyTorch code for HILA
☆28Nov 1, 2022Updated 3 years ago
jiwoogit / DCP-GAN
View on GitHub
[CVPR 2024] Diversity-aware Channel Pruning for StyleGAN Compression
☆26Jul 23, 2025Updated 11 months ago