tleyden / open-ocr
Run your own OCR-as-a-Service using Tesseract and Docker
☆1,342Updated last year
Related projects ⓘ
Alternatives and complementary repositories for open-ocr
- Drop-in replacement for wkhtmltopdf built on Go, Electron and Docker☆2,258Updated last year
- Python-based tools for document analysis and OCR☆3,423Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆371Updated 3 months ago
- A simple, higher level interface for Go web scraping.☆1,513Updated 7 years ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,273Updated 3 years ago
- A post-processing tool for scanned sheets of paper.☆1,038Updated 4 months ago
- Sync data between persistence engines, like ETL only not stodgy☆1,451Updated last year
- Job server in Go☆1,521Updated 6 years ago
- Uber tiny Docker images for all the things.☆1,592Updated 2 years ago
- Scan, index, and archive all of your paper documents (acquired by Mayan EDMS)☆2,559Updated 5 years ago
- Neural network OCR.☆1,129Updated 8 years ago
- A simple python OCR engine using opencv☆525Updated 9 months ago
- smartcrop finds good image crops for arbitrary crop sizes☆1,820Updated last year
- 💎 GUI for Data Modeling with Elasticsearch☆665Updated 7 years ago
- tools for working with streams of data☆1,312Updated last year
- The open source PaaS for Kubernetes.☆1,302Updated 4 years ago
- Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab☆2,431Updated 6 years ago
- Self-hosted document converting service with HTTP API☆251Updated 6 years ago
- ABBYY Cloud OCR SDK☆504Updated last year
- Your own local SMS gateway in Go☆1,452Updated 3 years ago
- A simple and flexible web crawler that follows the robots.txt policies and crawl delays.☆787Updated 3 years ago
- Time Series Alerting Framework☆3,404Updated 4 months ago
- Golang Natural Language Processing☆832Updated last year
- A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well☆704Updated 3 years ago
- Image recognition open source index and search engine☆619Updated last year
- A platform for backing crowdsourcing websites, built in golang for elasticsearch☆362Updated 4 years ago
- A supermarket receipt parser written in Python using tesseract OCR☆815Updated 2 months ago
- Scalable reverse image search built on Kubernetes and Elasticsearch☆1,248Updated 4 years ago
- Links to awesome OCR projects☆2,826Updated 4 months ago