danvk / boxedit
A web-based editor for Tesseract box files
☆27Updated 10 years ago
Alternatives and similar repositories for boxedit
Users that are interested in boxedit are comparing it to the libraries listed below
Sorting:
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆23Updated 10 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- A small Docker built for the OCRopus OCR system.☆20Updated 7 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Presentations, tutorials and data for the OCR workshop at LMU☆17Updated 7 years ago
- Fast Word Segmentation with Triangular Matrix☆81Updated 3 years ago
- Structured Data from PDF image-based files☆88Updated 12 years ago
- Explore networks and publish narratives.☆53Updated 4 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- A library for extracting tables from PDF files☆89Updated 11 years ago
- Facilitating the global conversation on academic literature☆266Updated 7 years ago
- Full data science workflows on the web☆21Updated 6 years ago
- Extract postal addresses from the DOM☆66Updated 12 years ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 9 years ago
- Gamera 3 for Python 2 (deprecated)☆39Updated 2 years ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 8 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Recognition Models for Kraken and CLSTM☆14Updated 5 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Python bindings for Neo4j☆26Updated 10 years ago
- official diybookscanner repository☆39Updated 11 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆130Updated 2 months ago
- Natural Language Generator for Python☆27Updated 8 years ago
- An implementation of RESTful web service for tesseract-OCR using tornado☆136Updated last year