☆19Feb 5, 2026Updated 3 months ago
Alternatives and similar repositories for Document-AI
Users that are interested in Document-AI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is for ExcelTableCNN project - open source automatic table detection on Excel sheets with computer vision☆15Jan 31, 2025Updated last year
- HTML in Python☆12Jul 19, 2024Updated last year
- Repository to use/train segmentation models for document layout analysis☆19Jan 13, 2022Updated 4 years ago
- ☆14Sep 6, 2024Updated last year
- This repo consists of the code as discussed in the Medium blog.☆17Sep 10, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Binarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)☆16May 15, 2025Updated last year
- Faster access to Tesseract-OCR from Python☆13Jun 8, 2021Updated 4 years ago
- Automated Document Intelligence Workflow☆40Nov 18, 2025Updated 6 months ago
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- 百度网盘AI大赛——图像处理挑战赛:文档图像摩尔纹消除第2名方案☆43Nov 28, 2023Updated 2 years ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆49Jun 13, 2024Updated last year
- 基于pdfium的pdf/ofd双引擎解析渲染引擎☆13Oct 15, 2024Updated last year
- 基于golang go语言(beego框架)下的ONLYOFFICE Document Server二次开发。 主要功能为文档的上传、预览、覆盖、回调等功能。☆10Oct 20, 2023Updated 2 years ago
- 电子病历标注工具DEMO☆13Jun 21, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Jul 7, 2025Updated 10 months ago
- 3D Slicer extension for SegmentAnyBone developed by Mazurowski Lab☆15Feb 25, 2026Updated 2 months ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- Use fastAPI to generate html web app that will serve a local directory or S3 bucket of images☆11Jan 18, 2021Updated 5 years ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j☆19Aug 19, 2024Updated last year
- Empowering RAG with a versatile model-driven data interface for all-purpose applications!☆17Sep 10, 2024Updated last year
- GPGPU on Android☆13Feb 16, 2023Updated 3 years ago
- This repo content all the dataset, the record and the config that were used in training a TensorFlow pedestrian detector model.☆15Jun 7, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tools for digitized document quality assessment.☆16Dec 8, 2022Updated 3 years ago
- Tutorial on how to use the SHAP library to explain the feature importance with Shapley values.☆19Sep 19, 2019Updated 6 years ago
- onlyoffice 破解版,支持x86和arm64架构,arm架构切换到arm分支即可☆15Dec 16, 2022Updated 3 years ago
- 🤗🧑🚀 A collection of tools to help you deploy, bundle HuggingFace Spaces and related assets with ease.☆21Apr 15, 2026Updated last month
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆89May 13, 2026Updated last week
- A live camera C++ example on a Raspberry Pi in OpenCV☆12Dec 7, 2021Updated 4 years ago
- The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.☆41Dec 7, 2023Updated 2 years ago
- Simple, Fast, Powerful and Easily extensible python package for extracting patterns from text, with over than 60 predefined Regular Expre…☆25Nov 26, 2022Updated 3 years ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆64Sep 6, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10May 25, 2022Updated 3 years ago
- ofd file view☆17Mar 24, 2025Updated last year
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆23Feb 21, 2018Updated 8 years ago
- Computer Vision: Python OCR & Object Detection Quick Starter, by Packt Publishing☆13Dec 15, 2025Updated 5 months ago
- 一个完整的智能分诊系统实现☆21May 31, 2022Updated 3 years ago
- ☆18May 31, 2023Updated 2 years ago
- 大数据生态解决方案基础平台: 搜索系统、公共系统、任务管理系统、数据binlog采集、基础爬虫系统、数据传输系统、运维告警系统、APM、报表系统☆11Jan 25, 2021Updated 5 years ago