DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurring 文档去模糊;Document shadow removal 文档去阴影;Document super-resolution 文档超分;Document Low-Light Enhancement 文档低光增强
☆48Oct 22, 2024Updated last year
Alternatives and similar repositories for DocBank-Document-Enhancement-Dataset
Users that are interested in DocBank-Document-Enhancement-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Sep 14, 2024Updated last year
- IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, app…☆21Nov 7, 2025Updated 6 months ago
- Finetune Stable Video Diffusion with Lora☆20Feb 3, 2024Updated 2 years ago
- 基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用☆293Oct 24, 2025Updated 6 months ago
- [IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion …☆101Jan 18, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadowing, …☆373Mar 10, 2026Updated last month
- 表格结构识别LGPMA推理☆25Nov 17, 2022Updated 3 years ago
- 基于NtChat项目的HTTPapi☆10Nov 17, 2022Updated 3 years ago
- 印章检测和印章文字识别☆22Mar 29, 2024Updated 2 years ago
- A function that takes as input a cropped text line image, and outputs the dewarped image.☆21Sep 2, 2025Updated 8 months ago
- A fast method for real face morphing (一个可以快速部署实现 的人脸变形方法)☆11May 31, 2022Updated 3 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☆80Jul 31, 2025Updated 9 months ago
- 中文论文、证券类、财报类PDF数据☆39Jun 13, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Nov 30, 2021Updated 4 years ago
- 利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …☆16Jul 22, 2024Updated last year
- ☆48Feb 7, 2025Updated last year
- College project about article http://www.cs.ust.hk/~quan/publications/yuan-deblur-siggraph07.pdf☆10Jan 25, 2013Updated 13 years ago
- 测试桌面端ncnn c++算法☆17Jun 15, 2025Updated 10 months ago
- 中文文档理解多模态语言模型,支持多模态文档信息抽取,文档embedding☆12Jun 26, 2022Updated 3 years ago
- ThinkGen: Generalized Thinking for Visual Generation☆53Dec 30, 2025Updated 4 months ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- a ros node using face_net do face_recognition☆12Jul 27, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆32Jan 24, 2025Updated last year
- ☆17Jul 30, 2024Updated last year
- 🎉🎨 This repository contains a reading list of papers on Embodied AI, including LLM/MLLM/VLA.☆13Aug 18, 2025Updated 8 months ago
- Mirror of the C++ ZXing library☆19Jul 21, 2020Updated 5 years ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆44Dec 31, 2024Updated last year
- Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution☆22Jun 10, 2025Updated 10 months ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆130Aug 27, 2024Updated last year
- chinese document classification of layoutlmv3 and layoutxlm☆45Oct 25, 2022Updated 3 years ago
- Recognition of Various Common Seal Scans in Complex Environments☆48May 28, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Compute benchmark of table structure recognition.☆28Dec 2, 2025Updated 5 months ago
- ☆13Nov 9, 2014Updated 11 years ago
- 🔥Char detection base on crnn 字符(单字)检测基于CRNN☆89May 16, 2023Updated 2 years ago
- Recent OCR and related works on PaddlePaddle 2.0☆12May 15, 2021Updated 4 years ago
- ☆44Jul 23, 2023Updated 2 years ago
- the code for CT MAR☆26Jan 31, 2024Updated 2 years ago
- A Chinese characters recognition repository with tensorrt format supported based on CRNN_Chinese_Characters_Rec and TensorRTx.☆18Mar 11, 2021Updated 5 years ago