[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
☆42Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for SelfDocSeg
Users that are interested in SelfDocSeg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆74Sep 12, 2024Updated last year
- ☆41Jun 15, 2024Updated last year
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆59Sep 9, 2024Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆45Apr 3, 2024Updated 2 years ago
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆31Mar 13, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Mar 20, 2023Updated 3 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- 利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure☆27Feb 23, 2024Updated 2 years ago
- ☆18May 30, 2023Updated 2 years ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆23Oct 11, 2025Updated 5 months ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- ICDAR 2024 Table OCR Model☆39Feb 25, 2026Updated last month
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"☆17Oct 6, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…☆13Jul 28, 2025Updated 8 months ago
- Table Structure Recognition☆82Mar 11, 2023Updated 3 years ago
- time-series row column classification☆14Jan 7, 2022Updated 4 years ago
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆183Sep 15, 2021Updated 4 years ago
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆104May 30, 2024Updated last year
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated last year
- ☆14Aug 31, 2023Updated 2 years ago
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated last year
- A curated list of resources dedicated to table recognition☆404Dec 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 6 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- [ECCV 2022] Learning Instance-Specific Adaptation for Cross-Domain Segmentation☆14Jul 17, 2022Updated 3 years ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Feb 24, 2023Updated 3 years ago
- [AAAI 2026]RAG, Benchmark, robust RAG generation☆34Nov 20, 2025Updated 4 months ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆63Sep 6, 2024Updated last year
- Table Structure Recognition☆28Jul 25, 2024Updated last year
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated last year
- Repo☆12Mar 7, 2022Updated 4 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆29Apr 16, 2023Updated 2 years ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆83Jun 25, 2024Updated last year
- Object Detection Model for Scanned Documents☆94Mar 6, 2025Updated last year
- ☆11Feb 23, 2024Updated 2 years ago
- Image to Latex using Encoder-Decoder architecture☆19May 21, 2025Updated 10 months ago