ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.
☆335Aug 22, 2024Updated last year
Alternatives and similar repositories for DocDiff
Users that are interested in DocDiff are comparing it to the libraries listed below
Sorting:
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆51Aug 5, 2024Updated last year
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆39May 28, 2025Updated 9 months ago
- This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadowing, …☆348Feb 4, 2026Updated 3 weeks ago
- Document Image Enhancement with GANs - TPAMI journal☆214Mar 24, 2023Updated 2 years ago
- [CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks☆566Aug 3, 2025Updated 6 months ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction☆27Jun 28, 2023Updated 2 years ago
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆51Aug 28, 2025Updated 6 months ago
- PR2024 GDB: Gated convolutions-based Document Binarization. This repository comprehensively collects the datasets that may be used in do…☆16Nov 27, 2023Updated 2 years ago
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆24Dec 9, 2023Updated 2 years ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆186Jan 17, 2025Updated last year
- A comprehensive list of awesome document image rectification papers.☆523Feb 1, 2026Updated 3 weeks ago
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping"☆198Jul 28, 2024Updated last year
- The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)☆136Jul 28, 2024Updated last year
- DocTr++ in PaddlePaddle☆58Jul 24, 2024Updated last year
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated 11 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆67Jun 6, 2024Updated last year
- Modeling Stroke Mask for End-to-End Text Erasing☆19Feb 9, 2023Updated 3 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Apr 24, 2024Updated last year
- [IJCAI2023] Your text images can be clearer!☆58Nov 18, 2025Updated 3 months ago
- The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.☆505Feb 1, 2026Updated 3 weeks ago
- [CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)☆188Nov 15, 2025Updated 3 months ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆55Jun 14, 2024Updated last year
- DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurrin…☆44Oct 22, 2024Updated last year
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆96Oct 20, 2025Updated 4 months ago
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation☆31May 27, 2024Updated last year
- A curated list of resources dedicated to table recognition☆406Dec 12, 2024Updated last year
- The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.☆89Jun 18, 2025Updated 8 months ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆119Aug 27, 2024Updated last year
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents☆105Jul 15, 2025Updated 7 months ago
- [WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer☆21Jan 14, 2026Updated last month
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆62Jul 4, 2024Updated last year
- 基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用☆283Oct 24, 2025Updated 4 months ago
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆271Dec 19, 2024Updated last year
- [PR 2025] DocAligner: Automating the Annotation of Photographed Documents Through Real-virtual Alignment☆102Aug 4, 2025Updated 6 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Inference, training and evaluation code for our models from the paper "Inv3D: a high-resolution 3D invoice dataset for template-guided si…☆58Feb 7, 2024Updated 2 years ago
- ☆60May 23, 2022Updated 3 years ago
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Sep 3, 2023Updated 2 years ago