RapidAI / RapidOCR
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
☆2,795Updated last week
Related projects: ⓘ
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。提供各种语言API。由 PaddleOCR C++ 编译。☆897Updated 3 weeks ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,170Updated 2 months ago
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,258Updated 3 months ago
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,584Updated last year
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆5,939Updated this week
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆672Updated 2 months ago
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,364Updated 2 weeks ago
- Collaboration with wangxupeng(https://github.com/wangxupeng)☆1,793Updated last week
- yolo3+ocr☆5,916Updated 2 years ago
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆11,728Updated last year
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆172Updated 8 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆2,489Updated this week
- Multilingual Voice Understanding Model☆2,625Updated 2 weeks ago
- darknet text detect and darknet cnn ocr☆1,136Updated 2 years ago
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆851Updated 3 weeks ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,352Updated last week
- A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的…☆1,348Updated 3 weeks ago
- [内测中]QPT - 致力于让开源项目更好通往互联网世界的Python to EXE工具(Python打包)。☆726Updated 4 months ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆4,614Updated last week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆4,768Updated last week
- ⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio …☆2,915Updated 2 weeks ago
- rapidocr onnx cpp☆153Updated 2 weeks ago
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,742Updated 2 weeks ago
- ☆775Updated 4 months ago
- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.☆3,267Updated 3 weeks ago
- 结束和新的开始☆928Updated 10 months ago
- A series of large language models developed by Baichuan Intelligent Technology