This repository is a concise collection of well known deep learning based document binarization models.
☆27Dec 24, 2022Updated 3 years ago
Alternatives and similar repositories for Document_Binarization_Collection
Users that are interested in Document_Binarization_Collection are comparing it to the libraries listed below
Sorting:
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- ☆17Nov 21, 2019Updated 6 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Jul 12, 2023Updated 2 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆27Dec 18, 2025Updated 2 months ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆14Dec 2, 2024Updated last year
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Dec 4, 2024Updated last year
- a dataset for camera-based table detection☆16Jul 30, 2021Updated 4 years ago
- Repository for the KVP10k dataset☆22Sep 18, 2025Updated 5 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- [ACM MM 2020] Exploring Font-independent Features for Scene Text Recognition☆44Nov 30, 2020Updated 5 years ago
- My second year project under advisor Prof. Changick Kim (2015.09~2016.03). The technique is for removing illumination distortions for cam…☆37Jun 1, 2020Updated 5 years ago
- Blender rendering codes for doc3D-dataset (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)☆125Feb 2, 2022Updated 4 years ago
- Code and Dataset for our paper: Layout-Aware Single-Image Document Flattening☆23Dec 16, 2024Updated last year
- ☆18Apr 11, 2023Updated 2 years ago
- Document Image Enhancement with GANs - TPAMI journal☆214Mar 24, 2023Updated 2 years ago
- [ACM MM 2022] Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild☆25Aug 12, 2022Updated 3 years ago
- Joint Semantic Segmentation and Boundary Detection using Iterative Pyramid Contexts (CVPR2020)☆22Apr 29, 2020Updated 5 years ago
- Official PyTorch implementation for ACM MM22 "UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior"☆25Aug 5, 2024Updated last year
- Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.☆48Aug 1, 2019Updated 6 years ago
- BoundaryNet - A Semi-Automatic Layout Annotation Tool☆24Dec 11, 2021Updated 4 years ago
- Document Rectification and Illumination Correction using a Patch-based CNN☆396Sep 28, 2022Updated 3 years ago
- ☆60May 23, 2022Updated 3 years ago
- ☆36Oct 7, 2023Updated 2 years ago
- [AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming☆36Jun 1, 2025Updated 9 months ago
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Aug 18, 2021Updated 4 years ago
- Update the latest text-related papers from top conferences☆27Mar 12, 2025Updated 11 months ago
- Unofficial implementation of ''BEDSR-Net: A Deep Shadow Removal from a Single Document Image'' with PyTorch☆63Nov 13, 2021Updated 4 years ago
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆39May 28, 2025Updated 9 months ago
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation☆31May 27, 2024Updated last year
- This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadowing, …☆348Feb 4, 2026Updated 3 weeks ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆126Nov 13, 2023Updated 2 years ago
- Basic HTR concepts/modules to boost performance☆39Nov 30, 2024Updated last year
- Inference, training and evaluation code for our paper "DocMatcher: Document Image Dewarping via Structural and Textual Line Matching" (WA…☆50Jul 1, 2025Updated 8 months ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Feb 8, 2023Updated 3 years ago
- ☆42Feb 7, 2023Updated 3 years ago
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 7 months ago