ayanban011/SwinDocSegmenter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ayanban011/SwinDocSegmenter)

ayanban011 / SwinDocSegmenter

[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation

☆74

Alternatives and similar repositories for SwinDocSegmenter

Users that are interested in SwinDocSegmenter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MaitySubhajit / SelfDocSeg
View on GitHub
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
☆43Oct 6, 2023Updated 2 years ago
namtuanly / MTL-TabNet
View on GitHub
MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition
☆103May 30, 2024Updated 2 years ago
ayanban011 / SVGCraft
View on GitHub
[WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
☆24Oct 11, 2025Updated 9 months ago
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
dali92002 / DocEnTR
View on GitHub
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
☆190Jan 17, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
whn09 / table_structure_recognition
View on GitHub
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…
☆52Jul 3, 2024Updated 2 years ago
yufanchen96 / RoDLA
View on GitHub
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆39Mar 26, 2025Updated last year
poloclub / tsr-convstem
View on GitHub
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
☆45Apr 21, 2026Updated 3 months ago
HCIILAB / M6Doc
View on GitHub
☆164May 8, 2025Updated last year
L597383845 / row-col-table-recognition
View on GitHub
time-series row column classification
☆14Jan 7, 2022Updated 4 years ago
ZZR8066 / SEMv2
View on GitHub
☆71Jun 26, 2024Updated 2 years ago
biswassanket / DocSegTr
View on GitHub
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
☆59Sep 9, 2024Updated last year
Charlotte-CharMLab / Fibottention
View on GitHub
Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"
☆17Oct 6, 2025Updated 9 months ago
cv-small-snails / Awesome-Table-Recognition
View on GitHub
A curated list of resources dedicated to table recognition
☆404Dec 12, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
JPLeoRX / detectron2-publaynet
View on GitHub
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆50Apr 16, 2023Updated 3 years ago
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆28Nov 11, 2022Updated 3 years ago
jiangnanboy / table_structure_recognition
View on GitHub
利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别，Swin-unet (Swin Transformer Unet) is used to identify the document table structure
☆27Feb 23, 2024Updated 2 years ago
jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
facebookresearch / MultiplexedOCR
View on GitHub
Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR
☆80Dec 2, 2022Updated 3 years ago
qhnhynmm / ViOCRVQA-Dataset
View on GitHub
The largest VQA dataset for Vietnamese. Related to the text content in the image.
☆19Apr 9, 2025Updated last year
hikopensource / DAVAR-Lab-OCR
View on GitHub
OCR toolbox from Davar-Lab
☆762Jun 29, 2026Updated 3 weeks ago
ayanban011 / GraphKD
View on GitHub
[ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
☆16Sep 6, 2024Updated last year
johnning2333 / M2Doc
View on GitHub
☆43Jun 15, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Arenaa / Accelerated-Generation-Techniques
View on GitHub
This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).
☆11May 24, 2024Updated 2 years ago
NormXU / Layout2Graph
View on GitHub
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆82Oct 14, 2023Updated 2 years ago
rossumai / docile
View on GitHub
DocILE: Document Information Localization and Extraction Benchmark
☆149Jun 17, 2026Updated last month
biswassanket / synth_doc_generation
View on GitHub
Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021
☆93Jul 16, 2021Updated 5 years ago
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
VinhLoiIT / vietnamese-htr
View on GitHub
Vietnamese handwritten text recognition system
☆18May 2, 2021Updated 5 years ago
saifullah3396 / docxclassifier
View on GitHub
☆17Jul 11, 2024Updated 2 years ago
CaseDrive / publaynet-models
View on GitHub
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆28Apr 16, 2023Updated 3 years ago
koninik / WordStylist
View on GitHub
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
☆83Jun 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆653Aug 12, 2024Updated last year
dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
jbaiter / archiscribe
View on GitHub
Web application for transcribing OCR ground truth from Archive.org
☆18Feb 22, 2018Updated 8 years ago
flairNLP / CleanCoNLL
View on GitHub
The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
☆25Jul 2, 2024Updated 2 years ago
andreagemelli / doc2graph
View on GitHub
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆139Oct 18, 2025Updated 9 months ago
qurator-spk / eynollah
View on GitHub
Document Layout Analysis
☆408Updated this week
ASVLeipzig / cor-asv-fst
View on GitHub
OCR-D post-correction module based on weighted finite-state transducers
☆11Jan 13, 2024Updated 2 years ago