danvk / boxeditLinks
A web-based editor for Tesseract box files
☆27Updated 10 years ago
Alternatives and similar repositories for boxedit
Users that are interested in boxedit are comparing it to the libraries listed below
Sorting:
- A node.js library for extracting data from scanned forms.☆117Updated 2 years ago
 - Exploring extracting tables from a PDF to CSV using PDF.JS☆105Updated 9 years ago
 - An expandable and scalable OCR pipeline☆88Updated 7 years ago
 - An implementation of RESTful web service for tesseract-OCR using tornado☆136Updated 2 years ago
 - Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
 - Python binding to libpoppler with focus on text extraction☆97Updated 3 years ago
 - official diybookscanner repository☆39Updated 11 years ago
 - Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆25Updated 10 years ago
 - Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
 - Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
 - A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools…☆294Updated 3 years ago
 - An api to parse a CV, in particular the elements of its publication list☆35Updated 7 years ago
 - Couchdb _design documents editor☆38Updated 8 years ago
 - Facilitating the global conversation on academic literature☆267Updated 8 years ago
 - Automatic text summarization☆243Updated 7 years ago
 - Extract tables from PDF pages.☆298Updated 5 years ago
 - Create an ERD for a database given as JSON-table-schema☆11Updated 9 years ago
 - Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
 - Semantic Annotation Tool for PDF documents☆19Updated 10 years ago
 - Illuminating the forest AND the trees in your data☆38Updated 9 years ago
 - Extract postal addresses from the DOM☆66Updated 13 years ago
 - Structured Data from PDF image-based files☆89Updated 12 years ago
 - OCR evaluation brought to you by University of Alicante☆66Updated 3 years ago
 - Train your own Natural Language Processor from a browser 🤖 (Prototype)☆174Updated 2 years ago
 - Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆133Updated last week
 - Expose Spacy nlp text parsing to Nodejs (and other languages) via socketIO☆227Updated 2 years ago
 - displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 7 years ago
 - conceptnet 4 bridge☆71Updated 10 years ago
 - Working with hOCR in Javascript☆136Updated 2 years ago
 - Journal scraper definitions for the ContentMine framework☆67Updated 7 years ago