📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.
☆5,980Feb 13, 2026Updated 2 weeks ago
Alternatives and similar repositories for RapidOCR
Users that are interested in RapidOCR are comparing it to the libraries listed below
Sorting:
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆71,369Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,084Feb 10, 2025Updated last year
- rapidocr onnx cpp☆323Mar 25, 2025Updated 11 months ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆924Aug 3, 2025Updated 6 months ago
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆42,182Nov 20, 2025Updated 3 months ago
- 文档方向分类☆222Feb 3, 2026Updated 3 weeks ago
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,274Aug 14, 2023Updated 2 years ago
- 基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle d…☆1,672Nov 1, 2025Updated 4 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Updated this week
- Based on RapidOCR, extract the PDF content☆185May 7, 2025Updated 9 months ago
- 🔥🔥🔥Java代码实现调用RapidOCR(基于PaddleOCR),适配Mac、Win、Linux,支持最新PP-OCRv4☆547Jun 5, 2024Updated last year
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆1,140Sep 11, 2025Updated 5 months ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,999Dec 5, 2025Updated 2 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆14,956Feb 4, 2026Updated 3 weeks ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆54,870Updated this week
- Convert the model in PaddleOCR to ONNX format☆112Jul 15, 2025Updated 7 months ago
- 基于序列表格识别算法 推理库,集成PP-Structure和modelscope等表格识别算法。☆410Sep 4, 2025Updated 5 months ago
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,513Jan 4, 2026Updated last month
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。提供各种语言API。由 PaddleOCR C++ 编译。☆1,435Apr 7, 2025Updated 10 months ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,729Feb 7, 2026Updated 3 weeks ago
- High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle☆3,653Updated this week
- Analysis of Chinese and English layouts 中英文版面分析☆267Updated this week
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,820Apr 9, 2025Updated 10 months ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,402Jan 3, 2025Updated last year
- Question and Answer based on Anything.☆13,859Mar 24, 2025Updated 11 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,017Apr 14, 2025Updated 10 months ago
- Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…☆37,352Nov 10, 2025Updated 3 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆16,947Feb 19, 2026Updated last week
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,861Jun 14, 2023Updated 2 years ago
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,072Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆27,170Updated this week
- ONNX Model Exporter for PaddlePaddle☆901Jan 13, 2026Updated last month
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆23,942Feb 23, 2026Updated last week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆73,900Updated this week
- SOTA Open Source TTS☆24,983Feb 2, 2026Updated 3 weeks ago
- LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key …☆29,842Jan 9, 2026Updated last month
- Convert PDF to markdown + JSON quickly with high accuracy☆31,857Feb 9, 2026Updated 2 weeks ago
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,718Nov 27, 2024Updated last year
- A generative speech model for daily dialogue.☆38,766Jan 18, 2026Updated last month