guitarmind / tesseract-web-service
An implementation of RESTful web service for tesseract-OCR using tornado
☆135Updated last year
Alternatives and similar repositories for tesseract-web-service:
Users that are interested in tesseract-web-service are comparing it to the libraries listed below
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆130Updated last year
- This is a tutorial on getting OCR running on a simple web server, using python, flask, tesseract-ocr, and leptonica☆258Updated 4 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆382Updated 6 months ago
- Various documents related to Tesseract OCR☆265Updated 3 years ago
- A node.js library for extracting data from scanned forms.☆117Updated 2 years ago
- Collaborative Whiteboard☆91Updated 9 years ago
- Detect and fix skew in images containing text☆263Updated 5 years ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 8 years ago
- Tesseract 4 OCR Compilation - Docker Container☆54Updated 2 years ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Updated 3 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- ABBYY Cloud OCR SDK☆510Updated last year
- A simple viewer and inspection tool for text boxes in PDF documents☆94Updated 2 years ago
- HTML5 collaborative whiteboard☆65Updated 3 years ago
- A web-based editor for Tesseract box files☆28Updated 10 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- Content Based Image Retrieval Plugin for Elasticsearch. It allows users to index images and search for similar images.☆408Updated 8 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆184Updated 2 months ago
- Demos for the limdu.js package☆18Updated 2 years ago
- ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (image…☆95Updated 6 years ago
- Node PDF Extract☆388Updated last year
- Free open-source OCR application for the Windows Store - A modern GUI front-end for the Microsoft OCR library. The application also inclu…☆153Updated 9 years ago
- Mapping photos of Old New York☆288Updated 2 months ago
- Tesseract documentation☆75Updated 3 years ago
- ☆49Updated 2 years ago
- gathering point for open source OCR scripts and diffs☆43Updated 10 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 9 years ago
- Apache Tika bridge for Node.js. Text and metadata extraction, language detection and more.☆142Updated last year
- HTML5 Customizable Reader & Admin Console - Librelio Digital Publishing Suite☆29Updated 9 years ago
- A simple program to extract the text from an image before performing OCR☆221Updated 4 years ago