ZeningLin/PEneo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZeningLin/PEneo)

ZeningLin / PEneo

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

☆41

Alternatives and similar repositories for PEneo

Users that are interested in PEneo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chongzhangFDU / TPP
View on GitHub
This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…
☆18Mar 15, 2024Updated 2 years ago
whlscut / DocLayLLM
View on GitHub
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆30Dec 18, 2025Updated 7 months ago
JG1VPP / MuTabNet
View on GitHub
ICDAR 2024/2026 Table OCR Model
☆39Jun 16, 2026Updated last month
lcy0604 / QT-TextSR
View on GitHub
This repository is the implementation of "QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text reco…
☆20Jul 9, 2025Updated last year
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
poloclub / tsr-convstem
View on GitHub
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
☆45Apr 21, 2026Updated 3 months ago
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
gayecolakoglu / LayIE-LLM
View on GitHub
☆15Jan 15, 2026Updated 6 months ago
Line-Kite / GraphLayoutLM
View on GitHub
☆14Sep 6, 2024Updated last year
yeungchenwa / HDR
View on GitHub
[AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents
☆111Jun 28, 2026Updated 3 weeks ago
HCIILAB / M5HisDoc
View on GitHub
☆34Dec 18, 2025Updated 7 months ago
adlnlp / doc_gcn
View on GitHub
☆19May 30, 2023Updated 3 years ago
GreatV / DocTrPP
View on GitHub
DocTr++ in PaddlePaddle
☆57Jul 24, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Sanster / OhMyTable
View on GitHub
Table Structure Recognition
☆28Jul 25, 2024Updated last year
RylonW / DocNLC
View on GitHub
Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…
☆44Mar 20, 2026Updated 4 months ago
tianchiguaixia / layoutlmv3-chinese
View on GitHub
该项目是为了使用layoutlmv3针对中文图片训练和推理。其中主要解决三个问题： 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作
☆63Sep 6, 2024Updated last year
chenxn2020 / GOSE
View on GitHub
[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"
☆17Dec 1, 2023Updated 2 years ago
ZZZHANG-jx / DocAligner
View on GitHub
[PR 2025] DocAligner: Automating the Annotation of Photographed Documents Through Real-virtual Alignment
☆110Aug 4, 2025Updated 11 months ago
ZZZHANG-jx / Recommendations-Document-Image-Processing
View on GitHub
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadowing, …
☆394Jun 1, 2026Updated last month
huridocs / pdf-reading-order
View on GitHub
☆16Apr 26, 2024Updated 2 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
WinterShiver / Token-Path-Prediction
View on GitHub
This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …
☆16May 29, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shannanyinxiang / UPOCR
View on GitHub
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆69Jun 6, 2024Updated 2 years ago
FutureRising007 / Table_Structure_Recognition
View on GitHub
Table Structure Recognition
☆83Mar 11, 2023Updated 3 years ago
SCUT-DLVCLab / GPT-4V_OCR
View on GitHub
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆128Nov 13, 2023Updated 2 years ago
LukeForeverYoung / UReader
View on GitHub
☆142Feb 13, 2024Updated 2 years ago
lizhh268 / ShadowMaskFormer
View on GitHub
[TAI 2025] Official implementation of TAI-accepted paper: ShadowMaskFormer: Mask Augmented Patch Embedding for Shadow Removal
☆15May 8, 2025Updated last year
shi-yx / URaG
View on GitHub
Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…
☆43Feb 4, 2026Updated 5 months ago
ZeningLin / ViBERTgrid-PyTorch
View on GitHub
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…
☆53Jan 9, 2024Updated 2 years ago
HCIILAB / M6Doc
View on GitHub
☆164May 8, 2025Updated last year
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZZZHANG-jx / DocRes
View on GitHub
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
☆628Aug 3, 2025Updated 11 months ago
ZZZHANG-jx / DocKylin
View on GitHub
[AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming
☆36Jun 1, 2025Updated last year
NormXU / Layout2Graph
View on GitHub
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆82Oct 14, 2023Updated 2 years ago
SCUT-DLVCLab / MegaHan97K
View on GitHub
[PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…
☆84May 18, 2026Updated 2 months ago
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
FreeOCR-AI / layoutreader
View on GitHub
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆322Aug 15, 2025Updated 11 months ago
irisXcoding / DocReal
View on GitHub
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
☆30Jun 28, 2023Updated 3 years ago