puhuilab/phocr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/puhuilab/phocr)

puhuilab / phocr

an open high-performance Optical Character Recognition (OCR) toolkit

☆304

Alternatives and similar repositories for phocr

Users that are interested in phocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hpc203 / document-undistort-onnxrun
View on GitHub
使用onnxruntime部署文档矫正，包括文档扭曲/模糊/阴影等情况，依然是包含C++和Python两个版本的程序
☆16Jan 3, 2025Updated last year
RapidAI / RapidOCR
View on GitHub
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
☆7,225Updated this week
yyccR / test_ncnn
View on GitHub
测试桌面端ncnn c++算法
☆17Jun 15, 2025Updated last year
hpc203 / CoupledTPS-opencv-dnn
View on GitHub
使用OpenCV部署CoupledTPS，包含了肖像矫正，不规则边界的图像矩形化，旋转图像矫正，三个模型。依然是包含C++和Python两个版本的程序
☆21Jul 4, 2024Updated 2 years ago
hpc203 / DeDoDe-onnxrun-cpp-py
View on GitHub
使用ONNXRuntime部署DeDoDe："局部特征匹配：检测，不要描述——描述，不要检测"。依然是C++和Python两个版本的程序
☆23Dec 22, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jingsongliujing / OnnxOCR
View on GitHub
基于PaddleOCR重构，并且脱离PaddlePaddle深度学习训练框架的轻量级OCR，推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle d…
☆1,836Jun 11, 2026Updated last month
zjd1988 / seetaface2_onnx_model
View on GitHub
only contain face detect 、5/81 points 、face recognization models
☆10Jul 9, 2020Updated 6 years ago
SWHL / ChineseDocumentPDF
View on GitHub
中文论文、证券类、财报类PDF数据
☆41Jun 13, 2024Updated 2 years ago
RapidAI / RapidOCRWeb
View on GitHub
The web version of RapidOCR
☆25Feb 27, 2026Updated 4 months ago
hpc203 / MOWA-onnxrun
View on GitHub
使用onnxruntime部署MOWA：多合一图像扭曲模型，能处理6种图像扭曲任务，依然是包含C++和Python两个版本的程序
☆34Jul 7, 2024Updated 2 years ago
WorldEditor50 / v4l2camera
View on GitHub
☆16Mar 24, 2025Updated last year
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,311Updated this week
NanoNets / docext
View on GitHub
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
☆2,032Mar 17, 2026Updated 4 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
hong19860320 / PaddleLite-generic-demo
View on GitHub
Paddle Lite classic demo for AI accelerators
☆11Feb 27, 2024Updated 2 years ago
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,155Feb 10, 2025Updated last year
zjd1988 / seetaface6_onnx_model
View on GitHub
☆30Mar 27, 2025Updated last year
RapidAI / TableStructureRec
View on GitHub
整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…
☆954Aug 3, 2025Updated 11 months ago
jiangnanboy / Image_KIE_LLM
View on GitHub
利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …
☆15Jul 22, 2024Updated last year
RapidAI / RapidTTS
View on GitHub
轻量级文本转语音工具，面向本地快速推理。A text-to-speech framework for fast and high-quality speech synthesis.
☆55May 29, 2026Updated last month
FeiGeChuanShu / ncnn_ppstructure
View on GitHub
ppstructure deploy by ncnn
☆36Jul 16, 2024Updated 2 years ago
Mohamed5341 / opencv-image
View on GitHub
This is an extension for Visual Studio Code to display OpenCV images while debugging
☆18Sep 5, 2023Updated 2 years ago
tanguymagne / UVDoc-Dataset
View on GitHub
Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation
☆35May 27, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
intsig-textin / xparse-sdk
View on GitHub
如需体验TextIn文档解析，请访问 https://cc.co/16YSIy
☆16Mar 4, 2025Updated last year
thomaszheng / OcrLiteMnn
View on GitHub
ChineseOcr Lite Mnn，超轻量级中文OCR PC Demo，使用MNN推理
☆28Mar 26, 2021Updated 5 years ago
SWHL / TrOCR-Formula-Rec
View on GitHub
基于TrOCR + UniMER-1M数据集，训练一个小而美的公式识别模型
☆30Mar 17, 2026Updated 4 months ago
AkiraHakuta / antlr4_tex2sym
View on GitHub
antlr4_tex2sym parses LaTeX math expressions and converts it into the equivalent SymPy form by using antlr4.
☆11Oct 7, 2020Updated 5 years ago
BADBADBADBOY / baipiaoOCR
View on GitHub
convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino
☆33Aug 16, 2023Updated 2 years ago
jdh-algo / JoyTTS
View on GitHub
☆41Jul 15, 2025Updated last year
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,233Apr 14, 2025Updated last year
carterwayneskhizeine / PaddleOCR-VL_CPU
View on GitHub
PaddleOCR CPU版 Windows安装指南
☆30Nov 25, 2025Updated 7 months ago
Zzz512 / TSD
View on GitHub
A dataset for tooth structured instance segmentation of dental panoramic X-ray.
☆15May 17, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
babosa / EasyAACEncoder-arm
View on GitHub
免费组件：EasyAACEncoder-arm 是一款商用音频转码到AAC的工具库，目前支持G711a/G711u/G726/PCM等音频格式的转码，跨平台，支持Windows（32&64）/Linux（32&64）/ARM各平台，相比于其他普通类型的音频转码库，音频转码C…
☆21Aug 2, 2019Updated 6 years ago
caipeng328 / ForCenNet
View on GitHub
☆81Jul 31, 2025Updated 11 months ago
chatdoc-com / OCRFlux
View on GitHub
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…
☆2,523Apr 14, 2026Updated 3 months ago
opendatalab / OHR-Bench
View on GitHub
(ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆104Dec 3, 2025Updated 7 months ago
BeautyyuYanli / tooluser
View on GitHub
Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)
☆58May 27, 2025Updated last year
RapidAI / RapidTableDetection
View on GitHub
检测和提取各种场景图片中的表格区域，并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…
☆119Dec 10, 2024Updated last year
alexw914 / RK_VideoPipe
View on GitHub
☆121Aug 1, 2024Updated last year