中文论文、证券类、财报类PDF数据
☆40Jun 13, 2024Updated last year
Alternatives and similar repositories for ChineseDocumentPDF
Users that are interested in ChineseDocumentPDF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆307Sep 10, 2024Updated last year
- 基于TrOCR + UniMER-1M数据集,训练一个小而美的公式识别模型☆29Mar 17, 2026Updated 2 months ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- ☆23Updated this week
- 使用onnxruntime部署MOWA:多合一图像扭曲模型,能处理6种图像扭曲任务,依然是包含C++和Python两个版本的程序☆34Jul 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CycleCenternet based on MMDetection☆22Jun 28, 2023Updated 2 years ago
- 可运行的Claude Code源码☆50Mar 31, 2026Updated last month
- 使用ONNXRuntime部署DeDoDe:"局部特征匹配:检测,不要描述——描述,不要检测"。依然是C++和Python两个版本的程序☆23Dec 22, 2023Updated 2 years ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 9 months ago
- onnx-java,这里利用java加载onnx模型,并进行推理。☆21May 19, 2022Updated 4 years ago
- Demo: GitHub search with Manticore Search☆14Aug 16, 2025Updated 9 months ago
- Julia wrapper for AlexeyAB's fork of Darknet for YOLOV4/3/2 Object Detection☆16Nov 24, 2025Updated 5 months ago
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆69Feb 10, 2026Updated 3 months ago
- 2024.06.19 本项目使用Chinese-CLIP搭建文搜图/图搜图页面,旨在帮助用户快速使用跨模态检索任务。本项目代码针对MUGE数据集约19w(189585张)数据作为底库数据。本项目提供了提取特征, 检索, 以及uI代码。☆23Jun 20, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆164Feb 28, 2024Updated 2 years ago
- ☆20Feb 16, 2025Updated last year
- A pure python library implemented by python3 for writing Latex formulas to word.☆15Jul 22, 2025Updated 9 months ago
- IEEE VCIP 2021: AnomalyHop: An SSL-based Image Anomaly Localization Method☆14Sep 18, 2021Updated 4 years ago
- DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurrin…☆48Oct 22, 2024Updated last year
- CDLA: A Chinese document layout analysis (CDLA) dataset☆294Sep 13, 2021Updated 4 years ago
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆81Apr 13, 2026Updated last month
- ☆28Oct 14, 2024Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆265Apr 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A object detection-then-grasping framework base on Libtorch、NCNN、Realsense camera、Kinova Jaco2。☆11Jan 13, 2022Updated 4 years ago
- 检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…☆117Dec 10, 2024Updated last year
- ☆25Jul 5, 2024Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆272Mar 24, 2026Updated last month
- The tampered text detection dataset☆23Aug 23, 2023Updated 2 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆153Sep 17, 2025Updated 8 months ago
- 通过浏览器渲染生成表格图像☆239Apr 10, 2024Updated 2 years ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆53Aug 5, 2024Updated last year
- shot_boundary_detection☆10Nov 26, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Jan 27, 2020Updated 6 years ago
- What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness☆27May 16, 2025Updated last year
- This repository summaries publications on Recognition of Handwritten Mathematical Expressions☆15Oct 27, 2017Updated 8 years ago
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆102Dec 17, 2025Updated 5 months ago
- ☆142Feb 13, 2024Updated 2 years ago
- LambChat — A multi-tenant AI Agent Harness Platform. Skills + MCP dual-engine powered, built for scale and isolation. SSE real-time strea…☆131Updated this week
- 🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets☆53Dec 31, 2025Updated 4 months ago