artunit / ossocrLinks
gathering point for open source OCR scripts and diffs
☆43Updated 11 years ago
Alternatives and similar repositories for ossocr
Users that are interested in ossocr are comparing it to the libraries listed below
Sorting:
- Experiments mining image collections using OpenCV☆64Updated 10 years ago
- Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)☆37Updated 10 years ago
- A backend store for the Annotator☆180Updated 9 years ago
- A node.js library for extracting data from scanned forms.☆117Updated 2 years ago
- Ocular is a state-of-the-art historical OCR system.☆265Updated last year
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 9 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- A small Docker built for the OCRopus OCR system.☆19Updated 7 years ago
- Polytonic Greek OCR engine derived from Gamera and based on the work of Dalitz and Brandt☆32Updated 10 years ago
- ☆185Updated 6 years ago
- Presentations, tutorials and data for the OCR workshop at LMU☆16Updated 8 years ago
- Keeps a mirror of DBpedia live in sync☆26Updated 4 years ago
- A MongoDB implementation of the W3C Web Annotation Protocol☆18Updated 3 years ago
- Mapping photos of Old New York☆293Updated 11 months ago
- ☆29Updated 8 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 4 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- XProc E-Book Processor☆15Updated 13 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- A Relaxed Schema Graph Database Management System☆52Updated 5 years ago
- SKOS analysis for Elasticsearch☆54Updated 9 years ago
- Web application for transcribing OCR ground truth from Archive.org☆17Updated 7 years ago
- KEA 5.0 (keyphrase extraction software), modified to be an XML-RPC service☆42Updated 14 years ago
- Structured Data from PDF image-based files☆89Updated 12 years ago
- xquerydoc - generate XQuery API documentation from your source code comments☆38Updated 8 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆61Updated 7 years ago
- Tools for text tokenization and encoding☆84Updated 4 years ago
- Project to digitize avant-garde periodicals☆12Updated 3 years ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- Node modules for working with the IIIF Image API☆15Updated 9 years ago