danvk / boxeditLinks
A web-based editor for Tesseract box files
☆27Updated 10 years ago
Alternatives and similar repositories for boxedit
Users that are interested in boxedit are comparing it to the libraries listed below
Sorting:
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- Recognition Models for Kraken and CLSTM☆14Updated 5 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 2 months ago
- Presentations, tutorials and data for the OCR workshop at LMU☆17Updated 8 years ago
- Gamera 3 for Python 2 (deprecated)☆39Updated 2 years ago
- A small Docker built for the OCRopus OCR system.☆20Updated 7 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆39Updated last year
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆24Updated 10 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆132Updated 2 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Ergonomic line-by-line transcription of scanned text.☆52Updated 4 years ago
- The CIS OCR PostCorrectionTool☆42Updated 2 years ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 9 years ago
- A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.☆10Updated 7 years ago
- Wrapper to pocketsphinx phoneme labeling tools☆18Updated 8 years ago
- PurePos is an open source hybrid morphological tagger.☆16Updated 4 years ago
- gzipstream allows Python to process multi-part gzip files from a streaming source☆23Updated 8 years ago
- Extract postal addresses from the DOM☆66Updated 12 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Create an ERD for a database given as JSON-table-schema☆11Updated 9 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Updated 2 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 3 years ago
- Tools for TICCL☆14Updated 2 weeks ago
- Small library containing various image processing algorithms (+ Python 3 bindings) that has almost no dependencies -- Moved to Gnome's Gi…☆62Updated 7 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆77Updated 2 weeks ago