mittagessen / kraken
OCR engine for all the languages
☆822Updated last week
Alternatives and similar repositories for kraken:
Users that are interested in kraken are comparing it to the libraries listed below
- Document Layout Analysis☆372Updated this week
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆391Updated 8 months ago
- Line based ATR Engine based on OCRopy☆1,134Updated 3 weeks ago
- ☆949Updated 7 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆185Updated 5 months ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆128Updated last week
- A deep learning toolkit specialized for handwritten document analysis☆235Updated 8 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆188Updated last week
- Generic framework for historical document processing☆375Updated 3 years ago
- Library used to deskew a scanned document☆460Updated last week
- Apply different text recognition services to images of handwritten documents.☆177Updated 2 years ago
- Master repository which includes most other OCR-D repositories as submodules☆73Updated 3 weeks ago
- Ocular is a state-of-the-art historical OCR system.☆262Updated 11 months ago
- Train Tesseract LSTM with make☆673Updated 3 weeks ago
- Pretrained mixed models to be used with Calamari.☆62Updated 7 months ago
- Detect textlines in document images☆93Updated 11 months ago
- Handwritten Text Recognition using TensorFlow☆276Updated 8 months ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated 2 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆210Updated last year
- Extract tables from scanned image PDFs using Optical Character Recognition.☆273Updated 4 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 4 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆520Updated 4 years ago
- Document Scanner and Word Segmentation☆123Updated 4 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆612Updated last year
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆4,622Updated last week
- Document image dewarping library using a cubic sheet model☆153Updated this week
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆80Updated 3 years ago