PaddlePaddle / PaddleOCRLinks
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
☆66,742Updated this week
Alternatives and similar repositories for PaddleOCR
Users that are interested in PaddleOCR are comparing it to the libraries listed below
Sorting:
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,243Updated 2 years ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,588Updated 2 weeks ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆5,513Updated this week
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,697Updated last year
- PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.☆12,949Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,994Updated 2 months ago
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,847Updated 2 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,041Updated 10 months ago
- SOTA Open Source TTS☆24,402Updated 3 weeks ago
- LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key …☆28,592Updated last month
- All-in-One Development Tool based on PaddlePaddle☆5,946Updated this week
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆14,035Updated last week
- Easy-to-use and powerful LLM and SLM library with awesome model zoo.☆12,884Updated last week
- Making large AI models cheaper, faster and more accessible☆41,300Updated 2 weeks ago
- The free and privacy-friendly screen recorder with no limits 🎥☆17,721Updated 2 weeks ago
- Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-ti…☆13,984Updated 2 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆51,012Updated this week
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆12,453Updated 2 months ago
- A treasure chest for visual classification and recognition powered by PaddlePaddle☆5,767Updated last month
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆26,627Updated this week
- Tesseract Open Source OCR Engine (main repository)☆71,496Updated last week
- Label Studio is a multi-type data labeling and annotation tool with standardized output format☆25,877Updated last week
- Production-ready platform for agentic workflow development.☆122,824Updated this week
- State-of-the-art 2D and 3D Face Analysis Project☆27,354Updated last month
- A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python☆21,336Updated this week
- A generative speech model for daily dialogue.☆38,359Updated 3 weeks ago
- OCR & Document Extraction using vision models☆11,992Updated 7 months ago
- A browser extension for automating your browser by connecting blocks☆20,829Updated 2 months ago
- 中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽…☆77,962Updated last year
- 🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite,…☆24,839Updated this week