danvk / boxedit
A web-based editor for Tesseract box files
☆28Updated 10 years ago
Alternatives and similar repositories for boxedit:
Users that are interested in boxedit are comparing it to the libraries listed below
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- A small Docker built for the OCRopus OCR system.☆20Updated 7 years ago
- A platform for tools that do stuff with data☆56Updated 6 years ago
- Extract postal addresses from the DOM☆66Updated 12 years ago
- Tools for working with Optical Character Recognition output☆16Updated 11 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Create an ERD for a database given as JSON-table-schema☆11Updated 9 years ago
- REST endpoint for Tabula☆25Updated 6 years ago
- gzipstream allows Python to process multi-part gzip files from a streaming source☆23Updated 8 years ago
- A library for extracting tables from PDF files☆90Updated 11 years ago
- Mechanical Turk on your own machine.☆206Updated 5 months ago
- Wrapper to pocketsphinx phoneme labeling tools☆18Updated 8 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Python bindings for Neo4j☆26Updated 10 years ago
- Casual live data stream and visualization server.☆90Updated 10 years ago
- Recognition Models for Kraken and CLSTM☆14Updated 5 years ago
- crawler for YouTube☆48Updated 11 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Apache Solr: Because your Database is not a Search Engine☆12Updated 6 years ago
- Persistent SSH tunnels☆59Updated 6 years ago
- Fast Word Segmentation with Triangular Matrix☆81Updated 3 years ago
- Tesseract Config files☆29Updated 3 years ago
- GDG London hackathon. Prototype for Android app to get display public data on your location in an info-graphic style.☆24Updated 11 years ago
- Extract images from PDF documents. Works on multiple and single PDF files☆14Updated 7 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆64Updated 8 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Updated last year
- Webapp to process Video and do Face Recognition with Facebox☆32Updated 7 years ago
- GHRecommender - personalized recommendations for GitHub projects based on information about repositories starred by the user☆26Updated 2 years ago