xhli-git/DocSAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xhli-git/DocSAM)

xhli-git / DocSAM

☆33

Alternatives and similar repositories for DocSAM

Users that are interested in DocSAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thanhnghiadk / syntactic_HME_generation
View on GitHub
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Feb 24, 2022Updated 4 years ago
whlscut / DocLayLLM
View on GitHub
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆30Dec 18, 2025Updated 7 months ago
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
buaacxf / VIPTR
View on GitHub
☆44Jul 9, 2024Updated 2 years ago
Howrunz / SSAN
View on GitHub
Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition
☆16Jan 21, 2025Updated last year
EDM-Research / VATr-pp
View on GitHub
☆18Jul 9, 2024Updated 2 years ago
RylonW / DocNLC
View on GitHub
Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…
☆44Mar 20, 2026Updated 4 months ago
tanguymagne / UVDoc-Dataset
View on GitHub
Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation
☆35May 27, 2024Updated 2 years ago
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
yufanchen96 / GraphDoc
View on GitHub
Graph-based Document Structure Analysis
☆18Mar 26, 2025Updated last year
FelixHertlein / doc-matcher
View on GitHub
Inference, training and evaluation code for our paper "DocMatcher: Document Image Dewarping via Structural and Textual Line Matching" (WA…
☆55Jul 1, 2025Updated last year
irisXcoding / DocReal
View on GitHub
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
☆30Jun 28, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shannanyinxiang / UPOCR
View on GitHub
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆69Jun 6, 2024Updated 2 years ago
IITB-LEAP-OCR / SPRINT
View on GitHub
SPRINT: Script-agnostic Structure Recognition in Tables
☆16Mar 26, 2025Updated last year
ZZZHANG-jx / DocAligner
View on GitHub
[PR 2025] DocAligner: Automating the Annotation of Photographed Documents Through Real-virtual Alignment
☆110Aug 4, 2025Updated 11 months ago
AiArt-Gao / HMEG
View on GitHub
[CVPR'24] Handwritten Mathematical Expressions Generation (HMEG)
☆34Jun 3, 2024Updated 2 years ago
shuyansy / Visual-Text-Processing-survey
View on GitHub
The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""
☆103Oct 20, 2025Updated 9 months ago
ZZZHANG-jx / DocKylin
View on GitHub
[AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming
☆36Jun 1, 2025Updated last year
MaxKinny / TabRecSet
View on GitHub
A large scale camera-taken table detection and recognition dataset.
☆150Apr 9, 2026Updated 3 months ago
aimagelab / Emuru-autoregressive-text-img
View on GitHub
Official PyTorch implementation for "Zero-Shot Styled Text Image Generation, but Make It Autoregressive" (CVPR25)
☆29Jul 31, 2025Updated 11 months ago
RaymondMcGuire / BOOK-CONTENT-SEGMENTATION-AND-DEWARPING
View on GitHub
Using FCN to segment the book's content and background, then dewarping the pages,
☆21Oct 9, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
wenwenyu / MASTER-pytorch
View on GitHub
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
☆281Dec 26, 2021Updated 4 years ago
JacobSRPage / super-res-dynamical
View on GitHub
☆12Dec 13, 2024Updated last year
caipeng328 / ForCenNet
View on GitHub
☆81Jul 31, 2025Updated 11 months ago
CU-DitecT / ECML-PKDD22-TrafficFlowGAN
View on GitHub
☆11Dec 23, 2024Updated last year
qingzhenduyu / TAMER
View on GitHub
Official implementation for AAAI 2025 paper: TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
☆37Jul 28, 2025Updated 11 months ago
SCUT-DLVCLab / OCR-Reasoning
View on GitHub
[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆76May 26, 2026Updated last month
xdxie / WAS_WordArt-Segmentation
View on GitHub
The official codes and datasets for Artistic Text Segmentation (ECCV 2024).
☆30Sep 24, 2025Updated 9 months ago
MAEHCM / ICL-D3IE
View on GitHub
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆54Aug 8, 2023Updated 2 years ago
yeungchenwa / HDR
View on GitHub
[AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents
☆111Jun 28, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
justliulong / OGHFYOLO
View on GitHub
The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…
☆13Jul 28, 2025Updated 11 months ago
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
ZZZHANG-jx / Marior
View on GitHub
[ACM MM 2022] Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild
☆26Aug 12, 2022Updated 3 years ago
ispamm / NAF-DPM
View on GitHub
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement
☆54Aug 5, 2024Updated last year
koninik / awesome-handwritten-text-generation
View on GitHub
This repo contains a curated list of research papers and resources focusing on Handwritten Text Generation (HTG)
☆24Jan 20, 2026Updated 6 months ago
LayTextLLM / LayTextLLM
View on GitHub
☆103Dec 23, 2024Updated last year
qingzhenduyu / ICAL
View on GitHub
Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…
☆29Aug 16, 2024Updated last year