TenMilesLotus/DTSM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TenMilesLotus/DTSM)

TenMilesLotus / DTSM

Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator

☆13

Alternatives and similar repositories for DTSM

Users that are interested in DTSM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lcy0604 / CTRNet-plus
View on GitHub
The official implement of CTRNet++.
☆15Dec 30, 2024Updated last year
ZZR8066 / SEM
View on GitHub
☆19Mar 10, 2023Updated 3 years ago
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
muhd-umer / pyramidtabnet
View on GitHub
Official PyTorch implementation of PyramidTabNet: Transformer-based Table Recognition in Image-based Documents
☆28Jun 8, 2026Updated last month
lcy0604 / QT-TextSR
View on GitHub
This repository is the implementation of "QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text reco…
☆20Jul 9, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
IBM / KVP10k
View on GitHub
Repository for the KVP10k dataset
☆23Sep 18, 2025Updated 10 months ago
whlscut / DocLayLLM
View on GitHub
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆30Dec 18, 2025Updated 7 months ago
Chunchunwumu / SEMv3
View on GitHub
The official PyTorch implementation of SEMv3.
☆53May 26, 2024Updated 2 years ago
ZZZHANG-jx / DocKylin
View on GitHub
[AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming
☆36Jun 1, 2025Updated last year
IITB-LEAP-OCR / SPRINT
View on GitHub
SPRINT: Script-agnostic Structure Recognition in Tables
☆16Mar 26, 2025Updated last year
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
SCUT-DLVCLab / OCR-Reasoning
View on GitHub
[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆76May 26, 2026Updated last month
L597383845 / row-col-table-recognition
View on GitHub
time-series row column classification
☆14Jan 7, 2022Updated 4 years ago
justliulong / OGHFYOLO
View on GitHub
The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…
☆13Jul 28, 2025Updated 11 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wangyuxin87 / Tampered-IC13
View on GitHub
-
☆24Oct 25, 2022Updated 3 years ago
sakura2233565548 / TabPedia
View on GitHub
This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
☆51Oct 16, 2024Updated last year
SCUT-DLVCLab / SCUT-EnsExam
View on GitHub
SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper im…
☆21Updated this week
adlnlp / mmvqa
View on GitHub
☆19Sep 11, 2024Updated last year
Token-family / TokenFD
View on GitHub
[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding
☆135Aug 27, 2025Updated 10 months ago
ZZZHANG-jx / WMeter-Reader
View on GitHub
[TIM 2025] Towards Accurate Readings of Water Meters by Eliminating Transition Error: New Dataset and Effective Solution
☆19Mar 5, 2025Updated last year
xhli-git / DocSAM
View on GitHub
☆33Apr 8, 2025Updated last year
naver-ai / trace
View on GitHub
TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)
☆32Mar 13, 2024Updated 2 years ago
ZZZHANG-jx / Marior
View on GitHub
[ACM MM 2022] Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild
☆26Aug 12, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SCUT-DLVCLab / GPT-4V_OCR
View on GitHub
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆128Nov 13, 2023Updated 2 years ago
MaxKinny / TabRecSet
View on GitHub
A large scale camera-taken table detection and recognition dataset.
☆150Apr 9, 2026Updated 3 months ago
GreatV / DocTrPP
View on GitHub
DocTr++ in PaddlePaddle
☆56Jul 24, 2024Updated last year
ZeningLin / PEneo
View on GitHub
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆41Apr 7, 2025Updated last year
JG1VPP / MuTabNet
View on GitHub
ICDAR 2024/2026 Table OCR Model
☆39Jun 16, 2026Updated last month
BunnySoCrazy / LA-DocFlatten
View on GitHub
Code and Dataset for our paper: Layout-Aware Single-Image Document Flattening
☆24Dec 16, 2024Updated last year
johnning2333 / M2Doc
View on GitHub
☆43Jun 15, 2024Updated 2 years ago
YuHengsss / Q-Zoom
View on GitHub
☆15Apr 15, 2026Updated 3 months ago
shengfly / writer-identification
View on GitHub
☆11Jun 3, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
shi-yx / URaG
View on GitHub
Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…
☆43Feb 4, 2026Updated 5 months ago
HollyLee2000 / SeBoW-paddle
View on GitHub
This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space
☆11Dec 10, 2021Updated 4 years ago
ZZZHANG-jx / GCDRNet
View on GitHub
[TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild
☆58Aug 28, 2025Updated 10 months ago
yuyq96 / TextHawk
View on GitHub
Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
☆68Nov 1, 2024Updated last year
ZeningLin / ViBERTgrid-PyTorch
View on GitHub
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…
☆53Jan 9, 2024Updated 2 years ago
shannanyinxiang / ViTEraser
View on GitHub
Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…
☆66Jul 4, 2024Updated 2 years ago
yeungchenwa / HDR
View on GitHub
[AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents
☆111Jun 28, 2026Updated 3 weeks ago