MaitySubhajit/SelfDocSeg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MaitySubhajit/SelfDocSeg)

MaitySubhajit / SelfDocSeg

[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)

☆43

Alternatives and similar repositories for SelfDocSeg

Users that are interested in SelfDocSeg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ayanban011 / SwinDocSegmenter
View on GitHub
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆74Sep 12, 2024Updated last year
johnning2333 / M2Doc
View on GitHub
☆43Jun 15, 2024Updated 2 years ago
poloclub / tsr-convstem
View on GitHub
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
☆45Apr 21, 2026Updated 3 months ago
naver-ai / trace
View on GitHub
TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)
☆32Mar 13, 2024Updated 2 years ago
Weifeng2Wu / ICDAR-2023-DTT-in-Images-1
View on GitHub
☆12Mar 20, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ayanban011 / GraphKD
View on GitHub
[ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
☆16Sep 6, 2024Updated last year
NormXU / Layout2Graph
View on GitHub
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆82Oct 14, 2023Updated 2 years ago
ayanban011 / SVGCraft
View on GitHub
[WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
☆24Oct 11, 2025Updated 9 months ago
adlnlp / doc_gcn
View on GitHub
☆19May 30, 2023Updated 3 years ago
muhd-umer / pyramidtabnet
View on GitHub
Official PyTorch implementation of PyramidTabNet: Transformer-based Table Recognition in Image-based Documents
☆28Jun 8, 2026Updated last month
JG1VPP / MuTabNet
View on GitHub
ICDAR 2024/2026 Table OCR Model
☆39Jun 16, 2026Updated last month
samakos / Document-AI-
View on GitHub
☆14Aug 31, 2023Updated 2 years ago
justliulong / OGHFYOLO
View on GitHub
The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…
☆13Jul 28, 2025Updated 11 months ago
FutureRising007 / Table_Structure_Recognition
View on GitHub
Table Structure Recognition
☆83Mar 11, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
L597383845 / row-col-table-recognition
View on GitHub
time-series row column classification
☆14Jan 7, 2022Updated 4 years ago
namtuanly / MTL-TabNet
View on GitHub
MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition
☆103May 30, 2024Updated 2 years ago
ganji15 / HiGANplus
View on GitHub
☆46Feb 7, 2023Updated 3 years ago
yufanchen96 / RoDLA
View on GitHub
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆39Mar 26, 2025Updated last year
ZeningLin / PEneo
View on GitHub
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆41Apr 7, 2025Updated last year
opendatalab / TRivia
View on GitHub
(CVPR 2026) TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition
☆35Jul 14, 2026Updated last week
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
microsoft / CompHRDoc
View on GitHub
Datasets and Evaluation Scripts for CompHRDoc
☆59Feb 25, 2025Updated last year
tianchiguaixia / layoutlmv3-chinese
View on GitHub
该项目是为了使用layoutlmv3针对中文图片训练和推理。其中主要解决三个问题： 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作
☆63Sep 6, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
AtsuMiyai / rethinking_rotation
View on GitHub
[WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…
☆12Feb 24, 2023Updated 3 years ago
Yuliang-Zou / InstCal-Pano
View on GitHub
[ECCV 2022] Learning Instance-Specific Adaptation for Cross-Domain Segmentation
☆14Jul 17, 2022Updated 4 years ago
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
RunpeiDong / DGMS
View on GitHub
[ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
☆11May 21, 2023Updated 3 years ago
Kompakkt / Viewer
View on GitHub
Kompakkt - the web based 3D viewer and 3D annotation system.
☆17Jul 14, 2026Updated last week
ajjimeno / icdar-task-b
View on GitHub
Repo
☆13Mar 7, 2022Updated 4 years ago
Vrroom / segment-anything-comic
View on GitHub
The repository provides code for training the SegmentAnything Model (SAM) for predicting frame polygons in comic books
☆58Mar 14, 2024Updated 2 years ago
koninik / WordStylist
View on GitHub
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
☆82Jun 25, 2024Updated 2 years ago
rosettatype / mehraban-book-pahlavi
View on GitHub
Mehraban Book Pahlavi typeface by Amir Mahdi Moslehi
☆11Mar 2, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
baulbo / Diard
View on GitHub
From document (PDF) or document images to analysis ready semi-structured data.
☆20Nov 4, 2022Updated 3 years ago
minhtannguyen / FourierFormer_NeurIPS
View on GitHub
☆13Oct 15, 2022Updated 3 years ago
LynnHaDo / Document-Layout-Analysis
View on GitHub
Object Detection Model for Scanned Documents
☆95Mar 6, 2025Updated last year
ljklonepiece / PushNet
View on GitHub
a deep neural network for planar pushing
☆13Aug 13, 2018Updated 7 years ago
Sanster / OhMyTable
View on GitHub
Table Structure Recognition
☆28Jul 25, 2024Updated last year
EkType / Gotu
View on GitHub
Rooted in calligraphy, Gotu is a modulated display typeface in Devanagari and Latin, with large loops and voluminous counters.
☆11Jan 10, 2020Updated 6 years ago
arpane4c5 / ActivityNet
View on GitHub
This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…
☆10Jul 19, 2017Updated 9 years ago