danvk / boxedit
A web-based editor for Tesseract box files
☆28Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for boxedit
- Next generation OCR engine based on LSTMs.☆52Updated 6 years ago
- A small Docker built for the OCRopus OCR system.☆19Updated 6 years ago
- Python bindings for Neo4j☆26Updated 10 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆37Updated 8 months ago
- Full data science workflows on the web☆20Updated 5 years ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 8 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆130Updated last year
- Eyebrowser Server☆27Updated 6 years ago
- Globally optimal geometric matching.☆10Updated 8 years ago
- Wrapper to pocketsphinx phoneme labeling tools☆18Updated 8 years ago
- Create an ERD for a database given as JSON-table-schema☆11Updated 8 years ago
- PDF Extraction Toolkit☆41Updated 3 years ago
- An implementation of RESTful web service for tesseract-OCR using tornado☆135Updated last year
- An expandable and scalable OCR pipeline☆86Updated 7 years ago
- Fast Word Segmentation with Triangular Matrix☆77Updated 3 years ago
- OpenStax centralized authentication and accounts service☆15Updated this week
- crawler for YouTube☆48Updated 10 years ago
- gzipstream allows Python to process multi-part gzip files from a streaming source☆23Updated 7 years ago
- official diybookscanner repository☆39Updated 10 years ago
- An api to parse a CV, in particular the elements of its publication list☆35Updated 6 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 8 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆47Updated last year
- ☆36Updated 9 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 2 years ago
- Inspired by Machine Learning course on coursera.org. A helper tool for generating ocr features for Machine Learning algos...☆78Updated 4 years ago
- A platform for tools that do stuff with data☆56Updated 5 years ago
- 📑 SQLite extension to add the Okapi BM25 ranking algorithm☆35Updated 9 years ago
- Extract postal addresses from the DOM☆66Updated 12 years ago
- bigram / trigram analysis of wikipedia; mainly mutual info☆22Updated 12 years ago
- schema to form + validation to json that conforms to schema☆61Updated 8 years ago