RapidAI/RapidDocEx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RapidAI/RapidDocEx)

RapidAI / RapidDocEx

📝 针对文档类图像做内容提取，将文档类图像一比一输出到Word或者Txt中，便于进一步使用或处理。后续计划支持输入PDF/图像，输出对应json格式、Txt格式、Word格式和Markdown格式。

☆208

Alternatives and similar repositories for RapidDocEx

Users that are interested in RapidDocEx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RapidAI / RapidLayout
View on GitHub
Analysis of Chinese and English layouts 中英文版面分析
☆275Mar 24, 2026Updated 3 months ago
RapidAI / RapidUnDistort
View on GitHub
修正文档扭曲/模糊/阴影等情况，使用onnx模型简单轻量部署，未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…
☆105Dec 17, 2025Updated 7 months ago
RapidAI / TableStructureRec
View on GitHub
整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…
☆954Aug 3, 2025Updated 11 months ago
RapidAI / RapidTable
View on GitHub
基于序列表格识别算法推理库，集成PP-Structure和modelscope等表格识别算法。
☆432Apr 23, 2026Updated 2 months ago
weavel-ai / Ape
View on GitHub
Your first AI prompt engineer
☆415Jul 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RapidAI / RapidOCRPDF
View on GitHub
Based on RapidOCR, extract the PDF content
☆191Mar 6, 2026Updated 4 months ago
RapidAI / RapidTableDetection
View on GitHub
检测和提取各种场景图片中的表格区域，并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…
☆119Dec 10, 2024Updated last year
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,232Apr 14, 2025Updated last year
360AILAB-NLP / 360LayoutAnalysis
View on GitHub
360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute
☆305Sep 10, 2024Updated last year
SWHL / TrOCR-Formula-Rec
View on GitHub
基于TrOCR + UniMER-1M数据集，训练一个小而美的公式识别模型
☆30Mar 17, 2026Updated 4 months ago
amjadraza / pandasai-app-gradio
View on GitHub
☆54Oct 13, 2024Updated last year
RapidAI / RapidOCR
View on GitHub
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
☆7,214Jul 9, 2026Updated last week
opendatalab / magic-html
View on GitHub
☆541May 13, 2026Updated 2 months ago
stephane-caron / matplotlive
View on GitHub
Stream live plots to a matplotlib figure
☆79Jul 3, 2026Updated 2 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Gmgge / TrOCR-Seal-Recognition
View on GitHub
基于transformer的ocr识别，在公章(印章识别, seal recognition）拓展应用
☆297Oct 24, 2025Updated 8 months ago
ollama-webui / ollama-modelfiles
View on GitHub
Ollama Modelfiles - Discover more at OllamaHub
☆20Dec 2, 2023Updated 2 years ago
elpassion / buildel
View on GitHub
AI Automation for everybody
☆164May 21, 2025Updated last year
EHEWON / ezwork-ai-doc-translation
View on GitHub
EZ-Work AI文档翻译，人人可用的开源AI文档翻译助手，可以快速低成本调用OpenAI等大语言模型api，帮助您实现txt/markdown/word/csv/excel/pdf/ppt的文档翻译。
☆253Mar 27, 2025Updated last year
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,154Feb 10, 2025Updated last year
jingsongliujing / OnnxOCR
View on GitHub
基于PaddleOCR重构，并且脱离PaddlePaddle深度学习训练框架的轻量级OCR，推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle d…
☆1,836Jun 11, 2026Updated last month
Theigrams / g1
View on GitHub
g1: Using GPT-4o to create o1-like reasoning chains
☆20Sep 17, 2024Updated last year
FreeOCR-AI / layoutreader
View on GitHub
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆322Aug 15, 2025Updated 11 months ago
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,796Jan 3, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Vaibhavs10 / llama-assistant
View on GitHub
☆171Aug 16, 2024Updated last year
ZHO-ZHO-ZHO / ComfyUI-Animated-optical-illusions
View on GitHub
Animated optical illusions in ComfyUI
☆21Jun 14, 2024Updated 2 years ago
bingqiang2021 / AIGC-Search
View on GitHub
☆11Aug 26, 2024Updated last year
shibing624 / ChatPilot
View on GitHub
ChatPilot: Chat Agent Web UI，实现Chat对话前端，支持Google搜索、文件网址对话（RAG）、代码解释器功能，复现了Kimi Chat(文件，拖进来；网址，发出来)。
☆600Jan 27, 2026Updated 5 months ago
CosmosShadow / gptpdf
View on GitHub
Using GPT to parse PDF
☆3,558Apr 17, 2025Updated last year
Sanster / OhMyTable
View on GitHub
Table Structure Recognition
☆28Jul 25, 2024Updated last year
NoEdgeAI / pdfdeal
View on GitHub
A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装，同时附带本地的文本处…
☆286Mar 18, 2026Updated 4 months ago
upstash / wikipedia-semantic-search
View on GitHub
Semantic Search on Wikipedia with Upstash Vector
☆470Dec 12, 2025Updated 7 months ago
jiangnanboy / table_structure_recognition
View on GitHub
利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别，Swin-unet (Swin Transformer Unet) is used to identify the document table structure
☆27Feb 23, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Topdu / OpenOCR
View on GitHub
OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commer…
☆1,415May 20, 2026Updated 2 months ago
X-PLUG / mPLUG-DocOwl
View on GitHub
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
☆2,408May 30, 2025Updated last year
ninehills / POINTS-Reader
View on GitHub
POINTS-Reader train
☆20Sep 20, 2025Updated 10 months ago
jjleng / sensei
View on GitHub
Yet another open source Perplexity
☆463Oct 20, 2024Updated last year
rahulnyk / knowledge_graph_maker
View on GitHub
☆257Aug 15, 2024Updated last year
docmee / aippt-api-python-demo
View on GitHub
Python 接入文多多AiPPT，通过主题/文件/网址等方式生成PPT，支持原生图表、动画、3D特效等复杂PPT的解析和渲染，支持用户自定义模板，支持智能添加动画。AI generates PowerPoint Presentation, Supports parsing…
☆32Nov 4, 2024Updated last year
Yusuke710 / nanoPerplexityAI
View on GitHub
The simplest open-source implementation of perplexity.ai
☆336Jan 24, 2025Updated last year