tesseract-ocr / tessdataLinks
Trained models with fast variant of the "best" LSTM models + legacy models
☆7,262Updated last year
Alternatives and similar repositories for tessdata
Users that are interested in tessdata are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,463Updated last year
- Source training data for Tesseract for lots of languages☆860Updated 7 months ago
- Tesseract Open Source OCR Engine (main repository)☆71,044Updated last month
- Fast integer versions of trained LSTM models☆580Updated last year
- Tesseract documentation☆2,225Updated 2 months ago
- Tesseract Open Source OCR Engine (main repository)☆3,901Updated last month
- A Python wrapper for Google Tesseract☆6,266Updated this week
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,982Updated this week
- Links to awesome OCR projects☆3,065Updated last year
- Train Tesseract LSTM with make☆704Updated 7 months ago
- yolo3+ocr☆6,109Updated 3 years ago
- Various documents related to Tesseract OCR☆267Updated 4 years ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,424Updated last year
- Line based ATR Engine based on OCRopy☆1,171Updated 6 months ago
- Box editor and trainer for Tesseract OCR☆248Updated 5 months ago
- Python-based tools for document analysis and OCR☆3,467Updated 4 years ago
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,833Updated 2 years ago
- A Gtk/Qt front-end to tesseract-ocr.☆1,868Updated 2 months ago
- ABBYY Cloud OCR SDK☆525Updated 2 years ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,694Updated 2 months ago
- make a better chinese character recognition OCR than tesseract☆1,514Updated 8 years ago
- A curated list of promising OCR resources☆1,693Updated 3 years ago
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,371Updated 2 weeks ago
- OFD Reader & Writer 开源的OFD处理库,支持文档生成、数字签名、文档保护、文档合并、转换、导出等功能,文档格式遵循《GB/T 33190-2016 电子文件存储与交换格式版式文档》。☆1,676Updated 2 weeks ago
- 超 轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,239Updated 2 years ago
- Open source Python library for converting PDF to DOCX.☆3,184Updated 6 months ago
- 身份证正反面识别,身份证扫描识别,二代身份证OCR识别,OCR极速识别身份证所有信息正反面均可。离线无需联网,极速秒扫。☆1,100Updated 2 years ago
- 身份证识别OCR☆490Updated 2 years ago
- C-OCR是携程自研的OCR项目,主要包括身份证、护照、火车票、签证等旅游相关证件、材料的识别。 项目包含4个部分,拒识、检测、识别、后处理。☆2,463Updated last year
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,911Updated last year