tmbarchive / docker-ocropus
A small Docker built for the OCRopus OCR system.
☆19Updated 7 years ago
Alternatives and similar repositories for docker-ocropus:
Users that are interested in docker-ocropus are comparing it to the libraries listed below
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 9 years ago
- Docker container to provide Apache Tika RESTful API☆40Updated 8 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Experiments mining image collections using OpenCV☆64Updated 9 years ago
- A text analysis interface for the humanities☆27Updated 13 years ago
- Presentations, tutorials and data for the OCR workshop at LMU☆17Updated 7 years ago
- gathering point for open source OCR scripts and diffs☆43Updated 10 years ago
- Next generation OCR engine based on LSTMs.☆52Updated 6 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- JSON schemas for OpenCorporates data☆19Updated 7 months ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆78Updated last year
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated 10 months ago
- This version of Rhizomer is archived, the current version is linked from:☆14Updated 6 years ago
- Serapis is a sentence identifier and modeling pipeline / built for Wordnik☆24Updated 8 years ago
- A simple PDF transcription project for PyBossa☆19Updated 9 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- Django framework for crowdsourcing complex tasks using MTurk☆64Updated 13 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 8 years ago
- Topic modeling web application☆40Updated 9 years ago
- ☆13Updated 10 years ago
- A simple proxy web service in 19 lines of Python code.☆23Updated 10 years ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆29Updated 2 years ago
- Scraper built with Scrapy.☆14Updated 5 months ago
- ☆36Updated last year