PaddlePaddle / PaddleOCRLinks
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
☆68,262Updated last week
Alternatives and similar repositories for PaddleOCR
Users that are interested in PaddleOCR are comparing it to the libraries listed below
Sorting:
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,256Updated 2 years ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,809Updated last month
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆5,676Updated last week
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,724Updated 4 months ago
- yolo3+ocr☆6,116Updated 3 years ago
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,709Updated last year
- PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.☆12,947Updated this week
- A treasure chest for visual classification and recognition powered by PaddlePaddle☆5,776Updated 2 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆121,347Updated this week
- Label Studio is a multi-type data labeling and annotation tool with standardized output format☆26,208Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,060Updated 11 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆14,597Updated 2 weeks ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆41,415Updated this week
- Tesseract Open Source OCR Engine (main repository)☆72,037Updated 2 weeks ago
- Production-ready platform for agentic workflow development.☆126,441Updated this week
- PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)☆23,583Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,159Updated 3 months ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆53,552Updated this week
- Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-ti…☆14,033Updated 3 months ago
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆159,631Updated last week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆20,213Updated last month
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,305Updated last year
- All-in-One Development Tool based on PaddlePaddle☆5,988Updated this week
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,850Updated 2 years ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,375Updated 7 months ago
- Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).☆15,501Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆31,071Updated this week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆22,647Updated this week
- Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentat…☆9,282Updated 2 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆65,942Updated last week