[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆75Sep 12, 2024Updated last year
Alternatives and similar repositories for SwinDocSegmenter
Users that are interested in SwinDocSegmenter are comparing it to the libraries listed below
Sorting:
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Oct 6, 2023Updated 2 years ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆22Oct 11, 2025Updated 4 months ago
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆103May 30, 2024Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Apr 3, 2024Updated last year
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated 11 months ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Apr 9, 2025Updated 10 months ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆186Jan 17, 2025Updated last year
- Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22☆14Aug 3, 2023Updated 2 years ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- time-series row column classification☆14Jan 7, 2022Updated 4 years ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…☆51Jul 3, 2024Updated last year
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆81Oct 14, 2023Updated 2 years ago
- ☆17Jul 11, 2024Updated last year
- ☆156May 8, 2025Updated 9 months ago
- A curated list of resources dedicated to table recognition☆406Dec 12, 2024Updated last year
- ☆38Oct 20, 2023Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆142May 15, 2024Updated last year
- ☆70Jun 26, 2024Updated last year
- 利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure☆28Feb 23, 2024Updated 2 years ago
- ☆40Jun 15, 2024Updated last year
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆78Apr 9, 2024Updated last year
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Dec 2, 2022Updated 3 years ago
- ☆45Jul 18, 2022Updated 3 years ago
- EAST-inspired Tensorflow-based Text Detector☆11Feb 18, 2021Updated 5 years ago
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 5 months ago
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Sep 3, 2023Updated 2 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Jul 12, 2023Updated 2 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- 轻量级文字识别技术创新大赛终榜第5名☆15Jul 15, 2021Updated 4 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆50Apr 16, 2023Updated 2 years ago
- A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…☆93Nov 12, 2024Updated last year
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping"☆198Jul 28, 2024Updated last year