PaddlePaddle / PaddleOCRLinks
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
☆70,442Updated this week
Alternatives and similar repositories for PaddleOCR
Users that are interested in PaddleOCR are comparing it to the libraries listed below
Sorting:
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,266Updated 2 years ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,903Updated 2 months ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆5,858Updated last week
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,728Updated 4 months ago
- yolo3+ocr☆6,119Updated 3 years ago
- All-in-One Development Tool based on PaddlePaddle☆6,011Updated last week
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆53,776Updated last week
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,714Updated last year
- Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-ti…☆14,062Updated 4 months ago
- LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key …☆29,549Updated last month
- A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。☆44,042Updated last week
- 一款轻量级、高性能、功能强大的内网穿透代理服务器。支持tcp、udp、socks5、http等几乎所有流量转发,可用来访问内网网站、本地支付接口调试、ssh访问、远程桌面,内网dns解析、内网socks5代理等等……,并带有功能强大的web管理端。a lightweight…☆33,897Updated last year
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆63,818Updated 2 weeks ago
- Tesseract Open Source OCR Engine (main repository)☆72,268Updated last month
- 中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽…☆78,868Updated last year
- PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.☆12,951Updated last week
- GitHub中文排行榜,各语言分设「软 件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。☆105,982Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,073Updated last year
- Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search☆42,678Updated this week
- YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite☆56,794Updated last week
- Mirror of https://git.ffmpeg.org/ffmpeg.git☆56,946Updated this week
- ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。☆20,675Updated 6 months ago
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆156,173Updated this week
- Label Studio is a multi-type data labeling and annotation tool with standardized output format☆26,353Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆27,076Updated this week
- 🔮 ChatGPT Desktop Application (Mac, Windows and Linux)☆54,393Updated last year
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus o…☆181,673Updated this week
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆41,897Updated 2 months ago
- ✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows☆87,242Updated 2 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,228Updated this week