jingsongliujing/OnnxOCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jingsongliujing/OnnxOCR)

jingsongliujing / OnnxOCR

基于PaddleOCR重构，并且脱离PaddlePaddle深度学习训练框架的轻量级OCR，推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.

☆1,836

Alternatives and similar repositories for OnnxOCR

Users that are interested in OnnxOCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RapidAI / RapidOCR
View on GitHub
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
☆7,225Updated this week
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,155Feb 10, 2025Updated last year
CKboss / pp_onnx
View on GitHub
pp_ocr_v4's ONNX version
☆25Jun 26, 2024Updated 2 years ago
PaddlePaddle / PaddleOCR
View on GitHub
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…
☆85,960Updated this week
RapidAI / RapidLayout
View on GitHub
Analysis of Chinese and English layouts 中英文版面分析
☆275Mar 24, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,233Apr 14, 2025Updated last year
PFCCLab / PPOCRLabel
View on GitHub
PPOCRLabelv3 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and r…
☆430Apr 24, 2026Updated 2 months ago
hiroi-sora / Umi-OCR
View on GitHub
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
☆46,176Nov 20, 2025Updated 8 months ago
oomol-lab / pdf-craft
View on GitHub
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
☆6,010Jun 27, 2026Updated 3 weeks ago
Yuliang-Liu / MonkeyOCR
View on GitHub
A lightweight LMM-based Document Parsing Model
☆6,605Updated this week
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,130Updated this week
shibing624 / imgocr
View on GitHub
Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理，中英文OCR开源…
☆134Apr 11, 2026Updated 3 months ago
hpc203 / PaddleOCR-v3-onnxrun-cpp-py
View on GitHub
使用ONNXRuntime部署PaddleOCR-v3, 包含C++和Python两个版本的程序
☆97Jun 19, 2023Updated 3 years ago
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,797Jan 3, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
frotms / PaddleOCR2Pytorch
View on GitHub
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
☆1,197Jul 10, 2026Updated last week
harry0703 / AudioNotes
View on GitHub
快速提取音视频内容，整理成一份结构化的markdown笔记
☆2,198Updated this week
RapidAI / PaddleOCRModelConvert
View on GitHub
Convert the model in PaddleOCR to ONNX format
☆120Jul 15, 2025Updated last year
RapidAI / RapidOcrOnnx
View on GitHub
rapidocr onnx cpp
☆362Mar 25, 2025Updated last year
puhuilab / phocr
View on GitHub
an open high-performance Optical Character Recognition (OCR) toolkit
☆304Jul 24, 2025Updated 11 months ago
WenmuZhou / PytorchOCR
View on GitHub
基于Pytorch的OCR工具库，支持常用的文字检测和识别算法
☆1,520Jan 4, 2026Updated 6 months ago
tabortao / OnnxOCR-UI
View on GitHub
OnnxOCR-UI 是基于 OnnxOCR 的高级批量图片/PDF OCR 识别工具，采用支持 Web、Windows、Linux、Macos 的 python 程序。打造，专为高效、易用和美观的批量文字识别场景设计。
☆16Dec 4, 2025Updated 7 months ago
RapidAI / TableStructureRec
View on GitHub
整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…
☆954Aug 3, 2025Updated 11 months ago
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,311Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chatdoc-com / OCRFlux
View on GitHub
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…
☆2,523Apr 14, 2026Updated 3 months ago
luckycucu / duguang-ocr-onnx
View on GitHub
读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English
☆59Nov 22, 2025Updated 7 months ago
nihui / ncnn-android-ppocrv5
View on GitHub
ncnn android paddle ocr v5
☆195May 27, 2026Updated last month
hiroi-sora / PaddleOCR-json
View on GitHub
OCR离线图片文字识别命令行windows程序，以JSON字符串形式输出结果，方便别的程序调用。提供各种语言API。由 PaddleOCR C++ 编译。
☆1,528Apr 7, 2025Updated last year
RapidAI / RapidUnDistort
View on GitHub
修正文档扭曲/模糊/阴影等情况，使用onnx模型简单轻量部署，未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…
☆105Dec 17, 2025Updated 7 months ago
CosmosShadow / gptpdf
View on GitHub
Using GPT to parse PDF
☆3,559Apr 17, 2025Updated last year
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,151Mar 25, 2026Updated 3 months ago
CVHub520 / X-AnyLabeling
View on GitHub
Effortless data labeling with AI support from Segment Anything and other awesome models.
☆9,824Updated this week
infrost / DeeplxFile
View on GitHub
基于Deeplx和Playwright提供的简单易用，快速，免费，不限制文件大小，支持超长文本翻译，跨平台的文件翻译工具 / Easy-to-use, fast, free, unlimited file size and cross platform file trans…
☆1,069Feb 16, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aehyok / video2blog
View on GitHub
视频转图文 AI跨平台客户端（win mac linux）
☆339Oct 11, 2024Updated last year
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,652Apr 10, 2026Updated 3 months ago
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,387Updated this week
xxnuo / MTranServer
View on GitHub
Offline translation model server with low resource consumption, fast speed, and private deployment capability. 低资源占用速度快可私有部署的离线翻译模型服务器
☆4,626Mar 8, 2026Updated 4 months ago
NanoNets / docext
View on GitHub
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
☆2,032Mar 17, 2026Updated 4 months ago
Zeyi-Lin / HivisionIDPhotos
View on GitHub
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
☆21,300Jul 3, 2026Updated 2 weeks ago
1Panel-dev / MaxKB
View on GitHub
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
☆22,164Updated this week