indic-ocr / ocrservice
OCR as a service
☆14Updated 7 years ago
Related projects: ⓘ
- OCR evaluation brought to you by University of Alicante☆66Updated 2 years ago
- Ergonomic line-by-line transcription of scanned text.☆47Updated 3 years ago
- User contributed (non Google) OCR models for Tesseract☆19Updated last year
- HOCR Specification Python Parser☆13Updated 8 years ago
- The hOCR Embedded OCR Workflow and Output Format☆72Updated last month
- PAGE XML format collection for document image page content and more☆62Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆38Updated last month
- Master repository which includes most other OCR-D repositories as submodules☆71Updated last month
- Web application for transcribing OCR ground truth from Archive.org☆17Updated 6 years ago
- Better models for Indic Scripts☆45Updated 6 years ago
- A rule-based iterative affix stripping stemmer for Tamil☆43Updated 6 years ago
- Collaborative Synchronized Corpus Annotation Tool☆10Updated 5 years ago
- Efficient hOCR tooling☆38Updated last week
- Aksharamukha Python Library☆43Updated 3 months ago
- Documentation and use cases for ALTO XML☆39Updated 6 years ago
- Transliteration module for Indian Languages☆77Updated 10 months ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Updated 2 years ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆23Updated 3 years ago
- tesseractXplore a tesseract ease of use gui with full control☆20Updated 2 years ago
- An expandable and scalable OCR pipeline☆86Updated 6 years ago
- ☆10Updated 5 years ago
- ☆11Updated last year
- Glyph Miner, a system for extracting glyphs from early typeset prints☆33Updated 7 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 6 years ago
- Scripts and results from our OCR roundup, available on Source☆150Updated 5 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆22Updated 9 years ago
- The CIS OCR PostCorrectionTool☆39Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆176Updated last month
- Extract structured data from PDF invoices☆13Updated 3 years ago