中文论文、证券类、财报类PDF数据
☆39Jun 13, 2024Updated last year
Alternatives and similar repositories for ChineseDocumentPDF
Users that are interested in ChineseDocumentPDF are comparing it to the libraries listed below
Sorting:
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆307Sep 10, 2024Updated last year
- 基于TrOCR + UniMER-1M数据集,训练一个小而美的公式识别模型☆29Updated this week
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- 使用onnxruntime部署MOWA:多合一图像扭曲模型,能处理6种图像扭曲任务,依然是包含C++和Python两个版本的程序☆34Jul 7, 2024Updated last year
- CycleCenternet based on MMDetection☆22Jun 28, 2023Updated 2 years ago
- 使用ONNXRuntime部署DeDoDe:"局部特征匹配:检测,不要描述——描述,不要检测"。依然是C++和Python两个版本的程序☆23Dec 22, 2023Updated 2 years ago
- ☆11Feb 23, 2024Updated 2 years ago
- ☆157May 8, 2025Updated 10 months ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆162Feb 28, 2024Updated 2 years ago
- Using ch_PP-OCRv4 model with tensorrt☆16Oct 28, 2025Updated 4 months ago
- ☆17Feb 16, 2025Updated last year
- DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurrin…☆46Oct 22, 2024Updated last year
- CDLA: A Chinese document layout analysis (CDLA) dataset☆289Sep 13, 2021Updated 4 years ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆261Apr 14, 2025Updated 11 months ago
- A object detection-then-grasping framework base on Libtorch、NCNN、Realsense camera、Kinova Jaco2。☆11Jan 13, 2022Updated 4 years ago
- 检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…☆117Dec 10, 2024Updated last year
- ☆28Oct 14, 2024Updated last year
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆269Mar 6, 2026Updated 2 weeks ago
- ONNX models of YOLO-World (an open-vocabulary object detection).☆25Jun 29, 2024Updated last year
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 6 months ago
- The tampered text detection dataset☆22Aug 23, 2023Updated 2 years ago
- [NO MAINTAINED] Delphi SDK for LeanCloud BaaS demo☆12Dec 6, 2019Updated 6 years ago
- What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness☆26May 16, 2025Updated 10 months ago
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆98Dec 17, 2025Updated 3 months ago
- This repository summaries publications on Recognition of Handwritten Mathematical Expressions☆15Oct 27, 2017Updated 8 years ago
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- ☆143Feb 13, 2024Updated 2 years ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆29Feb 4, 2026Updated last month
- A playbook for systematically maximizing the performance of deep learning models.☆25Jun 15, 2024Updated last year
- 利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure☆28Feb 23, 2024Updated 2 years ago
- ☆108Feb 16, 2021Updated 5 years ago
- wails-terminal☆11Mar 7, 2023Updated 3 years ago
- XFUND: A Multilingual Form Understanding Benchmark☆217Jul 15, 2022Updated 3 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- an open high-performance Optical Character Recognition (OCR) toolkit☆306Jul 24, 2025Updated 7 months ago
- ☆11Jul 15, 2014Updated 11 years ago
- The YOLOv12 C++ TensorRT Project in C++ and optimized using NVIDIA TensorRT☆26Oct 22, 2025Updated 4 months ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆933Aug 3, 2025Updated 7 months ago