gumblex / tessdata_chi
Retrained Tesseract OCR model for Chinese
☆94Updated 2 years ago
Related projects: ⓘ
- Based on RapidOCR, extract the PDF content.☆126Updated 3 weeks ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆91Updated 6 months ago
- 手写文字擦除第1名方案,水印智能消除赛第1名☆89Updated 4 months ago
- 版面分析 | 表格识别 | 文档方向分类☆185Updated last month
- Python bindings for WPS Office RPC (for Linux)☆219Updated last year
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆672Updated 2 months ago
- PDF OCR Application, adds an OCR text layer to scanned PDF files, allowing them to be copied and searched.☆55Updated 9 months ago
- transformers ocr for chinese☆339Updated last year
- 整理目前开源的表格识别模型,完善前后处理,模型转换为ONNX☆149Updated this week
- Analysis of Chinese and English layouts 中英文版面分析☆94Updated 2 months ago
- 开源的中英文离线 OCR,使用 PaddleOCR 实现,提供了简单的 Web 页面及接口☆116Updated 2 years ago
- “Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.☆205Updated last year
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆172Updated 8 months ago
- 一个简易的mdx词典,支持中英文☆51Updated 4 months ago
- Convert the model in PaddleOCR to ONNX format☆57Updated 3 weeks ago
- rapidocr onnx cpp☆153Updated 3 weeks ago
- A simple way to deploy PaddleOCR based on FastAPI. (PaddleOCR 的 FastAPI 快速部署方案)☆77Updated 2 months ago
- A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.☆168Updated last year
- 通过paddle ocr实现pdf转markdown☆49Updated 3 months ago
- ☆564Updated 3 weeks ago
- 中文汉语拼音辞典,汉字拼音字典,词典,成语词典,常用字、多音字字典数据库☆442Updated 8 months ago
- 一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.☆118Updated 2 years ago
- PPOCRLabel is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-…☆184Updated last year
- 关于本地离线翻译程序,支持文本翻译,下划线翻译,屏幕截图翻译,语音(音频文件)翻译,视频翻译,txt文件,PPT,Word,PDF,Excel,图片翻译。资源☆142Updated 7 months ago
- 基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用☆123Updated 3 months ago
- PDF 批量翻译,翻译后的PDF格式基本不变。导出PDF和Docx。优化并精简了来自于QPromise 的 EasyTrans。优化了通过百度翻译API稳定进行长翻译!☆126Updated 7 months ago
- ✅Deploy PaddleOCR with flask | 利用Flask对PaddleOCR进行部署,方便调用☆39Updated 2 years ago
- Fast integer versions of trained LSTM models☆472Updated last month
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。提供各 种语言API。由 PaddleOCR C++ 编译。☆897Updated 3 weeks ago
- 图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。☆104Updated 9 months ago