danvk / boxedit
A web-based editor for Tesseract box files
☆28Updated 10 years ago
Alternatives and similar repositories for boxedit:
Users that are interested in boxedit are comparing it to the libraries listed below
- The MIL-STD-498 DIDs converted to HTML.☆24Updated 12 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated this week
- GDG London hackathon. Prototype for Android app to get display public data on your location in an info-graphic style.☆24Updated 11 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated last year
- Presentations, tutorials and data for the OCR workshop at LMU☆17Updated 7 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- Next generation OCR engine based on LSTMs.☆52Updated 6 years ago
- User contributed (non Google) OCR models for Tesseract☆26Updated 5 months ago
- Visualization Storytelling Components☆32Updated 10 years ago
- Tools for working with Optical Character Recognition output☆16Updated 11 years ago
- Full data science workflows on the web☆20Updated 5 years ago
- REST endpoint for Tabula☆25Updated 6 years ago
- official diybookscanner repository☆39Updated 10 years ago
- Document Imaging Archive System. Home document imaging, with OCR. Scan documents (with SANE) or import ODF documents, assign tags. Use op…☆24Updated 9 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- Software to dewarp book picture images, and for building models of ruled surfaces☆11Updated 9 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Generate westminster parliament charts as virtual-dom SVG.☆12Updated 3 years ago
- Python bindings for Neo4j☆26Updated 10 years ago
- Uploads files with background uploads and progress feedback on modern browsers☆10Updated last year
- An implementation of RESTful web service for tesseract-OCR using tornado☆136Updated last year
- A small Docker built for the OCRopus OCR system.☆20Updated 7 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆23Updated 10 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Create an ERD for a database given as JSON-table-schema☆11Updated 9 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Updated last year
- Collection of Extension Functions for SQLite3☆13Updated 7 years ago
- Wrapper to pocketsphinx phoneme labeling tools☆18Updated 8 years ago