NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement
☆51Aug 5, 2024Updated last year
Alternatives and similar repositories for NAF-DPM
Users that are interested in NAF-DPM are comparing it to the libraries listed below
Sorting:
- ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…☆336Aug 22, 2024Updated last year
- This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadowing, …☆348Feb 4, 2026Updated last month
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆24Dec 9, 2023Updated 2 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Apr 24, 2024Updated last year
- [CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks☆566Aug 3, 2025Updated 7 months ago
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆39May 28, 2025Updated 9 months ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Apr 9, 2025Updated 10 months ago
- Document Image Enhancement with GANs - TPAMI journal☆214Mar 24, 2023Updated 2 years ago
- Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks☆12Aug 12, 2025Updated 6 months ago
- [PR 2025] DocAligner: Automating the Annotation of Photographed Documents Through Real-virtual Alignment☆102Aug 4, 2025Updated 7 months ago
- ☆70Jun 26, 2024Updated last year
- ☆102Dec 23, 2024Updated last year
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆51Aug 28, 2025Updated 6 months ago
- DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction☆27Jun 28, 2023Updated 2 years ago
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆97Oct 20, 2025Updated 4 months ago
- PR2024 GDB: Gated convolutions-based Document Binarization. This repository comprehensively collects the datasets that may be used in do…☆16Nov 27, 2023Updated 2 years ago
- DocTr++ in PaddlePaddle☆58Jul 24, 2024Updated last year
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (ACM MM'24)☆13Oct 15, 2025Updated 4 months ago
- ✨ Beautiful OCR Project Team Code by Team DKT☆12Jun 23, 2021Updated 4 years ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆27Dec 18, 2025Updated 2 months ago
- ☆41Nov 13, 2023Updated 2 years ago
- [AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Lear…☆501Mar 14, 2024Updated last year
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated 10 months ago
- ☆17Nov 21, 2019Updated 6 years ago
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping"☆198Jul 28, 2024Updated last year
- A comprehensive list of awesome document image rectification papers.☆526Feb 1, 2026Updated last month
- 【ICDAR 2024】Coarse-to-Fine Document Image Registration for Dewarping☆24Jul 15, 2024Updated last year
- ☆31Apr 8, 2025Updated 10 months ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆45Apr 11, 2025Updated 10 months ago
- [arXiv 2024] PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal☆17Feb 8, 2024Updated 2 years ago
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆97Mar 16, 2025Updated 11 months ago
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆75Dec 22, 2025Updated 2 months ago
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆19Aug 26, 2025Updated 6 months ago
- The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)☆136Jul 28, 2024Updated last year
- [ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning☆73Dec 17, 2025Updated 2 months ago
- 识别图像中的表格+OCR识别☆25Mar 8, 2024Updated last year
- Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention☆17Aug 16, 2023Updated 2 years ago
- [IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation☆347May 30, 2025Updated 9 months ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago