📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
☆6,153Mar 19, 2026Updated this week
Alternatives and similar repositories for RapidOCR
Users that are interested in RapidOCR are comparing it to the libraries listed below
Sorting:
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆72,686Updated this week
- rapidocr onnx cpp☆331Mar 25, 2025Updated 11 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,104Feb 10, 2025Updated last year
- 文档方向分类☆222Feb 3, 2026Updated last month
- 🔥🔥🔥Java代码实现调用RapidOCR(基于PaddleOCR),适配Mac、Win、Linux,支持最新PP-OCRv4☆556Jun 5, 2024Updated last year
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆933Aug 3, 2025Updated 7 months ago
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆42,585Nov 20, 2025Updated 4 months ago
- 基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle d…☆1,687Nov 1, 2025Updated 4 months ago
- Based on RapidOCR, extract the PDF content☆186Mar 6, 2026Updated 2 weeks ago
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,275Aug 14, 2023Updated 2 years ago
- Convert the model in PaddleOCR to ONNX format☆113Jul 15, 2025Updated 8 months ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆415Sep 4, 2025Updated 6 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,477Mar 1, 2026Updated 3 weeks ago
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆1,153Sep 11, 2025Updated 6 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆269Mar 6, 2026Updated 2 weeks ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆29,093Dec 5, 2025Updated 3 months ago
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。提供各种语言API。由 PaddleOCR C++ 编译。☆1,446Apr 7, 2025Updated 11 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆56,255Mar 7, 2026Updated 2 weeks ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,243Mar 12, 2026Updated last week
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,515Jan 4, 2026Updated 2 months ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,733Feb 7, 2026Updated last month
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,059Apr 14, 2025Updated 11 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,825Updated this week
- High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle☆3,661Updated this week
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,872Jun 14, 2023Updated 2 years ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,484Jan 3, 2025Updated last year
- Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…☆37,581Nov 10, 2025Updated 4 months ago
- Question and Answer based on Anything.☆13,887Mar 24, 2025Updated 11 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆17,043Updated this week
- ONNX Model Exporter for PaddlePaddle☆905Jan 13, 2026Updated 2 months ago
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆328Dec 29, 2023Updated 2 years ago
- RapidOcr onnxruntime推理 for Android☆108Apr 17, 2025Updated 11 months ago
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆27,405Updated this week
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,134Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,144Mar 7, 2026Updated 2 weeks ago
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆601May 15, 2024Updated last year
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,377Jan 12, 2026Updated 2 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆307Sep 10, 2024Updated last year
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆75,590Updated this week