biswassanket/DocSegTr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/biswassanket/DocSegTr)

biswassanket / DocSegTr

A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers

☆59

Alternatives and similar repositories for DocSegTr

Users that are interested in DocSegTr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
AndresPMD / Fine_Grained_Clf
View on GitHub
Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
☆25Nov 15, 2021Updated 4 years ago
AndresPMD / Pytorch-yolo-phoc
View on GitHub
Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
dali92002 / HTRbyMatching
View on GitHub
Hadwritten Text Recognition in Few-shot Scenario
☆22Mar 25, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MaitySubhajit / SelfDocSeg
View on GitHub
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
☆43Oct 6, 2023Updated 2 years ago
AndresPMD / Clip_CMR
View on GitHub
CLIP-based simple image-text matching baseline for COCO and F30K
☆15Sep 16, 2021Updated 4 years ago
AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
dali92002 / DocEnTR
View on GitHub
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
☆190Jan 17, 2025Updated last year
biswassanket / synth_doc_generation
View on GitHub
Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021
☆93Jul 16, 2021Updated 5 years ago
ayanban011 / GraphKD
View on GitHub
[ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
☆16Sep 6, 2024Updated last year
furkanbiten / stvqa_amazon_ocr
View on GitHub
STVQA and TextVQA OCR results from Amazon Text in Image pipeline
☆12Jul 18, 2022Updated 4 years ago
furkanbiten / SelectiveTextStyleTransfer
View on GitHub
ICDAR 2019
☆25Aug 2, 2019Updated 6 years ago
dali92002 / DE-GAN
View on GitHub
Document Image Enhancement with GANs - TPAMI journal
☆222Mar 24, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
AndresPMD / GCN_classification
View on GitHub
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
☆65Dec 1, 2022Updated 3 years ago
wangkai930418 / HCV_IIRC
View on GitHub
code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"
☆15Oct 28, 2022Updated 3 years ago
sounakdey / SigNet
View on GitHub
SigNet: Convolutional Siamese Network for Writer Independent Offline Signature Verification
☆80Oct 24, 2017Updated 8 years ago
emanuelevivoli / awesome-comics-understanding
View on GitHub
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆139Jan 2, 2025Updated last year
lcy0604 / CTRNet-plus
View on GitHub
The official implement of CTRNet++.
☆15Dec 30, 2024Updated last year
ayanban011 / SwinDocSegmenter
View on GitHub
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆74Sep 12, 2024Updated last year
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
allanj / LayoutLMv3-DocVQA
View on GitHub
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆53Sep 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FelixHertlein / inv3d
View on GitHub
Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".
☆13Dec 21, 2023Updated 2 years ago
ExplainableML / sketch-primitives
View on GitHub
ECCV 2022: Abstracting Sketches through Simple Primitives
☆27Jan 19, 2024Updated 2 years ago
RylonW / DocNLC
View on GitHub
Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…
☆44Mar 20, 2026Updated 4 months ago
samakos / Document-AI-
View on GitHub
☆14Aug 31, 2023Updated 2 years ago
FaisalAlamri0 / ViT-ZSL
View on GitHub
Multi-Headed Self-Attention via Vision Transformer for Zero-Shot Learning (ViT-ZSL)
☆33Aug 3, 2021Updated 4 years ago
RaymondMcGuire / BOOK-CONTENT-SEGMENTATION-AND-DEWARPING
View on GitHub
Using FCN to segment the book's content and background, then dewarping the pages,
☆21Oct 9, 2021Updated 4 years ago
qurator-spk / eynollah
View on GitHub
Document Layout Analysis
☆408Updated this week
dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
MCLAB-OCR / KnowledgeMiningWithSceneText
View on GitHub
☆38Feb 4, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xiaoyu258 / DocProj
View on GitHub
Document Rectification and Illumination Correction using a Patch-based CNN
☆397Sep 28, 2022Updated 3 years ago
herobd / FUDGE
View on GitHub
Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"
☆33Mar 4, 2022Updated 4 years ago
HorizonParadox / DRCCBI
View on GitHub
☆34Jan 13, 2025Updated last year
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
Oksitaine / RealVisXL-v4.0
View on GitHub
Photorealism model use RealVisXL v4.0
☆12Feb 20, 2024Updated 2 years ago
fh2019ustc / DocTr-Plus
View on GitHub
The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.
☆528Feb 1, 2026Updated 5 months ago
lluisgomez / TextTopicNet
View on GitHub
Self-supervised learning of visual features through embedding images into text topic spaces
☆95Aug 20, 2022Updated 3 years ago