文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSharpening / HandwritingDenoisingBeautifying / DocShadowRemoval / document_image_dewarping / DocTrimmingEnhancement)。
☆120Aug 27, 2024Updated last year
Alternatives and similar repositories for Doc-Image-Tool
Users that are interested in Doc-Image-Tool are comparing it to the libraries listed below
Sorting:
- ☆41Nov 13, 2023Updated 2 years ago
- Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning☆20Dec 20, 2023Updated 2 years ago
- DocTr++ in PaddlePaddle☆58Jul 24, 2024Updated last year
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆14Dec 21, 2023Updated 2 years ago
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆95Dec 17, 2025Updated 2 months ago
- ☆14Jun 10, 2025Updated 8 months ago
- Inference, training and evaluation code for our paper "DocMatcher: Document Image Dewarping via Structural and Textual Line Matching" (WA…☆50Jul 1, 2025Updated 8 months ago
- pdf invoice parser,pdf-ofd发票解析。☆40Jul 15, 2024Updated last year
- 利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …☆16Jul 22, 2024Updated last year
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆51Aug 28, 2025Updated 6 months ago
- The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.☆505Feb 1, 2026Updated last month
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆39May 28, 2025Updated 9 months ago
- [CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks☆566Aug 3, 2025Updated 7 months ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆162Feb 28, 2024Updated 2 years ago
- 基于pdfium的pdf/ofd双引擎解析渲染引擎☆14Oct 15, 2024Updated last year
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆48Aug 26, 2024Updated last year
- 表格结构识别LGPMA推理☆25Nov 17, 2022Updated 3 years ago
- ☆102Dec 23, 2024Updated last year
- Official PyTorch implementation for ACM MM22 "UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior"☆25Aug 5, 2024Updated last year
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆13Nov 15, 2022Updated 3 years ago
- ☆14Sep 6, 2024Updated last year
- ☆75Jul 31, 2025Updated 7 months ago
- Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1☆10Mar 19, 2020Updated 5 years ago
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction☆27Jun 28, 2023Updated 2 years ago
- The official repo for “WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?“☆71May 19, 2025Updated 9 months ago
- 检测透视图像中的矩形文档并对其进行矫正☆31Sep 16, 2022Updated 3 years ago
- [AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming☆36Jun 1, 2025Updated 9 months ago
- [CVPR2023] Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution☆189Feb 4, 2026Updated last month
- DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurrin…☆44Oct 22, 2024Updated last year
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆67Jun 6, 2024Updated last year
- 利用大模型LLM对中文文本、图片以及pdf中的非结构化文本内容进行分析,并提取主-谓-宾(SPO)三元组的知识形式,以及将这些关系可视化为知识图谱。The large LLM model is used to analyze the unstructured text co…☆23Apr 16, 2025Updated 10 months ago
- Document Dewarping with Control Points☆196Oct 7, 2022Updated 3 years ago
- 通过浏览器渲染生成表格图像☆236Apr 10, 2024Updated last year
- 多格式(word/excel/ppt转pdf/ofd, pdf/ofd相互转换)文档转换系统☆17Apr 27, 2024Updated last year
- IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, app…☆20Nov 7, 2025Updated 3 months ago
- 卡证和文档检测和矫正☆79Sep 18, 2024Updated last year
- 轻量模型的图像分析web服务,包括倾斜矫正OCR,公章(印章)检测+识别,车牌识别。api方案使用FastAPI+Gunicorn,提供gradio展示。☆102Apr 30, 2024Updated last year
- ☆13Oct 28, 2023Updated 2 years ago