📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
☆6,249Mar 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for RapidOCR
Users that are interested in RapidOCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆74,809Apr 2, 2026Updated last week
- rapidocr onnx cpp☆335Mar 25, 2025Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,110Feb 10, 2025Updated last year
- 🔥🔥🔥Java代码实现调用RapidOCR(基于PaddleOCR),适配Mac、Win、Linux,支持最新PP-OCRv4☆558Jun 5, 2024Updated last year
- 文档方向分类☆221Feb 3, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆942Aug 3, 2025Updated 8 months ago
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆42,988Nov 20, 2025Updated 4 months ago
- 基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle d…☆1,732Nov 1, 2025Updated 5 months ago
- Based on RapidOCR, extract the PDF content☆187Mar 6, 2026Updated last month
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,273Aug 14, 2023Updated 2 years ago
- Convert the model in PaddleOCR to ONNX format☆115Jul 15, 2025Updated 8 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,557Apr 3, 2026Updated last week
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆415Sep 4, 2025Updated 7 months ago
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆1,155Sep 11, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Analysis of Chinese and English layouts 中英文版面分析☆268Mar 24, 2026Updated 2 weeks ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆29,246Dec 5, 2025Updated 4 months ago
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。提供各种语言API。由 PaddleOCR C++ 编译。☆1,466Apr 7, 2025Updated last year
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,544Mar 17, 2026Updated 3 weeks ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆58,131Apr 3, 2026Updated last week
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,515Jan 4, 2026Updated 3 months ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,742Feb 7, 2026Updated 2 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,099Apr 14, 2025Updated 11 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,825Mar 17, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle☆3,671Updated this week
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,873Jun 14, 2023Updated 2 years ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,562Jan 3, 2025Updated last year
- Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…☆37,738Nov 10, 2025Updated 5 months ago
- Question and Answer based on Anything.☆13,932Mar 24, 2025Updated last year
- Toolkit for linearizing PDFs for LLM datasets/training☆17,120Mar 25, 2026Updated 2 weeks ago
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆336Dec 29, 2023Updated 2 years ago
- ONNX Model Exporter for PaddlePaddle☆911Mar 18, 2026Updated 3 weeks ago
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆27,631Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- RapidOcr onnxruntime推理 for Android☆111Apr 17, 2025Updated 11 months ago
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,206Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,322Apr 1, 2026Updated last week
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆605May 15, 2024Updated last year
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,377Jan 12, 2026Updated 2 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆306Sep 10, 2024Updated last year
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆77,665Updated this week