mittagessen / kraken
OCR engine for all the languages
☆748Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for kraken
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆370Updated 3 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆180Updated last week
- Document Layout Analysis☆350Updated this week
- A deep learning toolkit specialized for handwritten document analysis☆207Updated 2 months ago
- Line based ATR Engine based on OCRopy☆1,051Updated last week
- Collection of OCR-related python tools and wrappers from @OCR-D☆119Updated this week
- Generic framework for historical document processing☆373Updated 3 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆180Updated last month
- Library used to deskew a scanned document☆418Updated last month
- Page to PAGE Layout Analysis Tool☆191Updated 2 years ago
- Working with hOCR in Javascript☆122Updated last year
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last month
- Provides OCR (Optical Character Recognition) services through web applications☆239Updated 9 months ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Updated 3 years ago
- Train Tesseract LSTM with make☆639Updated 5 months ago
- Python-based tools for document analysis and OCR☆3,422Updated 3 years ago
- Ocular is a state-of-the-art historical OCR system.☆255Updated 5 months ago
- Toolbox for OCR post-correction☆123Updated 5 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 3 years ago
- Apply different text recognition services to images of handwritten documents.☆172Updated last year
- Document Layout Analysis resources repos for development with PdfPig.☆583Updated last year
- Detect and read handwritten words on scanned pages.☆106Updated last year
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆355Updated 2 months ago
- Handwritten text recognition using transformers.☆154Updated 3 months ago
- ☆886Updated 2 months ago
- Detect textlines in document images☆90Updated 5 months ago
- Web based JavaScript GUI library for proofreading/editing hOCR☆92Updated 6 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆203Updated last year
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- Python library to extract tabular data from images and scanned PDFs☆264Updated 3 months ago