tleyden / open-ocr
Run your own OCR-as-a-Service using Tesseract and Docker
☆1,349Updated last year
Alternatives and similar repositories for open-ocr:
Users that are interested in open-ocr are comparing it to the libraries listed below
- Python-based tools for document analysis and OCR☆3,436Updated 3 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆824Updated 5 years ago
- Links to awesome OCR projects☆2,902Updated 7 months ago
- A post-processing tool for scanned sheets of paper.☆1,065Updated 7 months ago
- Text page dewarping using a "cubic sheet" model☆1,455Updated last year
- Drop-in replacement for wkhtmltopdf built on Go, Electron and Docker☆2,251Updated last year
- A supermarket receipt parser written in Python using tesseract OCR☆827Updated 5 months ago
- ABBYY Cloud OCR SDK☆510Updated last year
- Neural network OCR.☆1,128Updated 8 years ago
- OCR engine for all the languages☆791Updated this week
- Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.☆2,672Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆382Updated 6 months ago
- Line based ATR Engine based on OCRopy☆1,118Updated 3 months ago
- A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well☆721Updated 3 years ago
- Sync data between persistence engines, like ETL only not stodgy☆1,446Updated last year
- Various documents related to Tesseract OCR☆265Updated 3 years ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,272Updated 4 years ago
- Job server in Go☆1,519Updated 6 years ago
- A curated list of promising OCR resources☆1,670Updated 2 years ago
- smartcrop finds good image crops for arbitrary crop sizes☆1,828Updated last year
- Detect and fix skew in images containing text☆263Updated 5 years ago
- Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text☆1,663Updated 7 months ago
- Golang Natural Language Processing☆829Updated last year
- Source training data for Tesseract for lots of languages☆845Updated 11 months ago
- A curated list of awesome projects to simplify and improve paper and document scanning.☆428Updated 2 weeks ago
- Mapping photos of Old New York☆288Updated 2 months ago
- Scalable reverse image search built on Kubernetes and Elasticsearch☆1,250Updated 4 years ago
- A simple, higher level interface for Go web scraping.☆1,510Updated 8 years ago
- 🤖 A Node queue API for generating PDFs using headless Chrome. Comes with a CLI, S3 storage and webhooks for notifying subscribers about …☆2,626Updated 11 months ago
- Polite, slim and concurrent web crawler.☆2,039Updated 3 years ago