concordusapps / python-hocr
HOCR manipulation and utility library; provides hocr2pdf binary.
☆15Updated 7 years ago
Alternatives and similar repositories for python-hocr:
Users that are interested in python-hocr are comparing it to the libraries listed below
- HOCR Specification Python Parser☆13Updated 9 years ago
- PDF Extraction Toolkit☆41Updated 4 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Experiments mining image collections using OpenCV☆64Updated 9 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- ☆16Updated 10 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 5 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 7 years ago
- Version 1.0 of the CrowdTruth Framework for crowdsourcing ground truth data, for training and evaluation of cognitive computing systems. …☆61Updated 6 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- A tool for semantic relation extraction. The program finds pairs of semantically related words based on the text definitions coming from …☆26Updated 10 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 8 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- Python bindings for libwapiti☆67Updated 5 years ago
- Development version of ndlstm, multidimensional LSTMs for TensorFlow☆19Updated 7 years ago
- Presentations, tutorials and data for the OCR workshop at LMU☆17Updated 7 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- The CIS OCR PostCorrectionTool☆42Updated 2 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 8 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 9 years ago
- Python utilities for detecting textual reuse☆21Updated 9 years ago
- Crop And Splice Segments (of scanned pages)☆14Updated 6 years ago