Official PyTorch implementation for ACM MM22 "UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior"
☆25Aug 5, 2024Updated last year
Alternatives and similar repositories for UDoc-GAN
Users that are interested in UDoc-GAN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning☆20Dec 20, 2023Updated 2 years ago
- Code and Dataset for our paper: Layout-Aware Single-Image Document Flattening☆24Dec 16, 2024Updated last year
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- [ACM TOMM] Official implementation of "TextCoT: Zoom-In for Enhanced Multimodal Text-Rich Image Understanding"☆45Feb 27, 2026Updated 2 months ago
- Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos☆16May 23, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A comprehensive list of awesome document image rectification papers.☆540Apr 15, 2026Updated 2 weeks ago
- The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)☆136Jul 28, 2024Updated last year
- Inference, training and evaluation code for our paper "DocMatcher: Document Image Dewarping via Structural and Textual Line Matching" (WA…☆53Jul 1, 2025Updated 10 months ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆52Aug 26, 2024Updated last year
- The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.☆92Jun 18, 2025Updated 10 months ago
- The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.☆428Jun 18, 2025Updated 10 months ago
- Code from our paper "Template-guided Illumination Correction for Document Images with Imperfect Geometric Reconstruction " (ICCVW) 2023.☆28Feb 7, 2024Updated 2 years ago
- The official code for “SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning”, ICCV, 20…☆33Jul 21, 2024Updated last year
- Blender rendering codes for doc3D-dataset (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)☆127Feb 2, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A hybrid dataset for document unwarping (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)☆198Dec 18, 2025Updated 4 months ago
- python opencv 文档照片与证件照片的仿射变换的矫正☆11Nov 3, 2020Updated 5 years ago
- OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正☆93Nov 27, 2020Updated 5 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated 2 years ago
- The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.☆519Feb 1, 2026Updated 3 months ago
- Document Dewarping with Control Points☆198Oct 7, 2022Updated 3 years ago
- Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)☆610Nov 10, 2024Updated last year
- [ACM MM 2022] Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild☆25Aug 12, 2022Updated 3 years ago
- ☆80Jul 31, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official repo for “WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?“☆73May 19, 2025Updated 11 months ago
- Document Rectification and Illumination Correction using a Patch-based CNN☆396Sep 28, 2022Updated 3 years ago
- Repository for Intrinsic Decomposition of Document Images In-the-Wild (BMVC '20)☆50May 14, 2023Updated 2 years ago
- ☆67Nov 30, 2023Updated 2 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆21Dec 4, 2024Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆29Dec 18, 2025Updated 4 months ago
- ☆27Nov 8, 2024Updated last year
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆43Mar 20, 2026Updated last month
- ☆102Dec 23, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DocTr++ in PaddlePaddle☆57Jul 24, 2024Updated last year
- Document Image Dewarping☆413Sep 30, 2019Updated 6 years ago
- [ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning☆74Dec 17, 2025Updated 4 months ago
- This repository is a concise collection of well known deep learning based document binarization models.☆29Dec 24, 2022Updated 3 years ago
- [AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming☆36Jun 1, 2025Updated 11 months ago
- Synthesize image datasets of documents in natural scenes with Python+Blender3D☆60Aug 7, 2022Updated 3 years ago
- ☆43Mar 26, 2022Updated 4 years ago