DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurring 文档去模糊;Document shadow removal 文档去阴影;Document super-resolution 文档超分;Document Low-Light Enhancement 文档低光增强
☆44Oct 22, 2024Updated last year
Alternatives and similar repositories for DocBank-Document-Enhancement-Dataset
Users that are interested in DocBank-Document-Enhancement-Dataset are comparing it to the libraries listed below
Sorting:
- ☆19Sep 14, 2024Updated last year
- IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, app…☆20Nov 7, 2025Updated 4 months ago
- 表格结构识别LGPMA推理☆25Nov 17, 2022Updated 3 years ago
- ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…☆338Aug 22, 2024Updated last year
- Practical AI with PyTorch 2024☆33Jan 5, 2025Updated last year
- This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadowing, …☆348Feb 4, 2026Updated last month
- 机器学习使用过的API中文版及机器学习的理论知识☆13Jun 8, 2025Updated 9 months ago
- ☆48Feb 7, 2025Updated last year
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- 视觉信息抽取任务中,使用OCR识别 结果规范多模态大模型的回答☆44Dec 31, 2024Updated last year
- 夸克网盘转存工具☆11Aug 27, 2024Updated last year
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 4 years ago
- 中文文档理解多模态语言模型,支持多模态文档信息抽取,文档embedding☆12Jun 26, 2022Updated 3 years ago
- Emotion classification of speech using GMMHMMs☆10Jul 1, 2016Updated 9 years ago
- ACL'2024-Main: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Languag…☆12Sep 19, 2025Updated 5 months ago
- Gemini API for OCR☆15Nov 17, 2025Updated 3 months ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- CUDA code with exact k-NN algorithm for multiple GPU system.☆12Jul 5, 2024Updated last year
- Mirror of the C++ ZXing library☆19Jul 21, 2020Updated 5 years ago
- Personal blog website, will not update anymore. https://shixiangwang.github.io is new site☆10Aug 13, 2019Updated 6 years ago
- ☆14Sep 6, 2024Updated last year
- 基于NtChat项目的HTTPapi☆10Nov 17, 2022Updated 3 years ago
- ☆75Jul 31, 2025Updated 7 months ago
- chinese document classification of layoutlmv3 and layoutxlm☆46Oct 25, 2022Updated 3 years ago
- ☆11Jul 28, 2025Updated 7 months ago
- 根据夏曹俊老师的课程,整理出来的demo☆10Aug 19, 2019Updated 6 years ago
- SWIS: Self-Supervised Representation Learning For Writer Independent Offline Signature Verification", ICIP 2022 (Oral)☆11Feb 17, 2023Updated 3 years ago
- ☆12Aug 24, 2020Updated 5 years ago
- a ros node using face_net do face_recognition☆12Jul 27, 2016Updated 9 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1☆10Mar 19, 2020Updated 5 years ago
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- The source code repository for the paper.☆21Sep 8, 2025Updated 6 months ago
- 🤡 An up-to-date & curated list of awesome KBQA papers, methods & resources.☆10Jul 14, 2022Updated 3 years ago
- Recent OCR and related works on PaddlePaddle 2.0☆12May 15, 2021Updated 4 years ago
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- For easier and more readable tensorflow codes☆13Sep 1, 2019Updated 6 years ago
- AI Infrastructure Engineer Learning Track - Production ML infrastructure curriculum (2-4 years experience)☆39Nov 3, 2025Updated 4 months ago
- A dataset for tooth structured instance segmentation of dental panoramic X-ray.☆15May 17, 2024Updated last year