liuyifan6613/DocBank-Document-Enhancement-Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liuyifan6613/DocBank-Document-Enhancement-Dataset)

liuyifan6613 / DocBank-Document-Enhancement-Dataset

DocBank 文档图像增强数据集，此数据集用于文档图像增强，具体任务包括以下内容：Seal detection & Removal 印章检测 & 移除；Watermark detection & Removal 水印检测 & 移除；Document deblurring 文档去模糊；Document shadow removal 文档去阴影；Document super-resolution 文档超分；Document Low-Light Enhancement 文档低光增强

☆48

Alternatives and similar repositories for DocBank-Document-Enhancement-Dataset

Users that are interested in DocBank-Document-Enhancement-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CXH-Research / StainRestorer
View on GitHub
[WACV 2025] High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer
☆23Jan 14, 2026Updated 6 months ago
OmarSamirz / ImageFromTextGenerator
View on GitHub
IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, app…
☆21Nov 7, 2025Updated 8 months ago
Gmgge / TrOCR-Seal-Recognition
View on GitHub
基于transformer的ocr识别，在公章(印章识别, seal recognition）拓展应用
☆297Oct 24, 2025Updated 9 months ago
Royalvice / DocDiff
View on GitHub
ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…
☆350Aug 22, 2024Updated last year
JianqiangWan / VLPT-STD
View on GitHub
Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)
☆12Mar 21, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bcmi / Awesome-Visible-Watermark-Removal
View on GitHub
☆119Sep 30, 2022Updated 3 years ago
Correr-Zhou / MagicTailor
View on GitHub
[IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion …
☆98Jan 18, 2026Updated 6 months ago
yingqichao / robustly_hiding_images_into_images
View on GitHub
☆18Jul 23, 2022Updated 4 years ago
philexohf / Practical_AI_with_PyTorch
View on GitHub
Practical AI with PyTorch 2024
☆33Jan 5, 2025Updated last year
ZZZHANG-jx / DocRes
View on GitHub
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
☆628Aug 3, 2025Updated 11 months ago
Helen-Cheung / Baidu-AI-Challenge-Scene-Text-Removal
View on GitHub
☆15Feb 28, 2022Updated 4 years ago
Topdu / DocPTBench
View on GitHub
Benchmarking End-to-End Photographed Document Parsing and Translation
☆17Dec 4, 2025Updated 7 months ago
Alvin-YCHEN / document-sharing
View on GitHub
☆16Feb 29, 2020Updated 6 years ago
jiangnanboy / Doc-Image-Tool
View on GitHub
文档图像处理工具(Document image processing tool)，包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…
☆136Aug 27, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SWHL / ChineseDocumentPDF
View on GitHub
中文论文、证券类、财报类PDF数据
☆41Jun 13, 2024Updated 2 years ago
Zzz512 / TSD
View on GitHub
A dataset for tooth structured instance segmentation of dental panoramic X-ray.
☆15May 17, 2024Updated 2 years ago
Marvin0724 / Face_bone_transform
View on GitHub
☆11Jul 28, 2025Updated last year
ChenyuGAO-CS / SMA
View on GitHub
The imdb files with SBD-Trans OCR for TextVQA dataset.
☆11Nov 30, 2021Updated 4 years ago
jiangnanboy / Image_KIE_LLM
View on GitHub
利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …
☆15Jul 22, 2024Updated 2 years ago
taolusi / SECURE
View on GitHub
ACL'2024-Main: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Languag…
☆12Sep 19, 2025Updated 10 months ago
RylonW / DocNLC
View on GitHub
Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…
☆44Mar 20, 2026Updated 4 months ago
guoxy25 / Ocean-OCR
View on GitHub
☆48Feb 7, 2025Updated last year
yingqichao / From-Image-to-Imuge-Immunized-Image-Generation
View on GitHub
From Image to Imuge: Immunized Image Generation, official code, implemented by PyTorch, ACMMM 2021 paper
☆21Apr 2, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ibaiGorordo / ONNX-msg_chn_wacv20-depth-completion
View on GitHub
Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX
☆25Oct 4, 2021Updated 4 years ago
Alore111 / Monocular_Camera_Speed_Measurement
View on GitHub
YOLO+DeepSort+DepthAnything 单目摄像头测距测速
☆59Apr 24, 2025Updated last year
ChengxuLiu / FrDiff
View on GitHub
[ICCV'25] Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing
☆28Sep 9, 2025Updated 10 months ago
pufeiyang / CT-Lung-segment
View on GitHub
Algorithms for lung contour segmentation, lung trachea segmentation and pulmonary blood vessel segmentation based on CT images.
☆15Nov 18, 2021Updated 4 years ago
ziyangyeh / iMeshSegNet
View on GitHub
iMeshSegNet implementations.
☆19Sep 5, 2023Updated 2 years ago
Attendfov163com / chinese-layoutlm-v2
View on GitHub
中文文档理解多模态语言模型，支持多模态文档信息抽取，文档embedding
☆12Jun 26, 2022Updated 4 years ago
5chen / C2SSM
View on GitHub
(CVPR2026) Scan Clusters, Not Pixels: A Cluster-Centric Paradigm for Efficient Ultra-high-definition Image Restoration
☆27Apr 16, 2026Updated 3 months ago
DocTron-hub / OCRVerse
View on GitHub
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
☆30Feb 4, 2026Updated 5 months ago
goponycn / ponyexam
View on GitHub
小马考试通ponyexam开源版是基于PHP开发的免费开源在线考试系统，能够快速搭建在线考试平台，系统支持多种题型：单选题、多选题、判断题、填空题、问答题等题型。支持在线考试，支持自动和人工两种方式批阅试卷。适合各类学校、教培系统、企业内训等场景使用。
☆15Sep 30, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Shef-AIRE / llms_post-ocr_correction
View on GitHub
Leveraging LLMs for Post-OCR Correction of Historical Newspapers
☆18May 12, 2026Updated 2 months ago
CV-Reimplementation / TraditionalDocumentShadowRemoval
View on GitHub
Several traditional method for document shadow removal
☆10Apr 29, 2024Updated 2 years ago
HANDS-FREE / facenet_demo
View on GitHub
a ros node using face_net do face_recognition
☆12Jul 27, 2016Updated 10 years ago
Toon-nooT / notebooks
View on GitHub
☆17Updated this week
ming053l / PhaSR
View on GitHub
PhaSR: Generalized Image Shadow Removal with Physically Aligned Priors (CVPR 2026)
☆36Jun 24, 2026Updated last month
hzauzxb / guidance-ocr
View on GitHub
视觉信息抽取任务中，使用OCR识别结果规范多模态大模型的回答
☆44Dec 31, 2024Updated last year
KahimWong / ADCD-Net
View on GitHub
[ICCV'25] ADCD-Net: Robust Document Image Forgery Localization via Adaptive DCT Feature and Hierarchical Content Disentanglement
☆26Mar 29, 2026Updated 4 months ago