mittagessen / krakenLinks
OCR engine for all the languages
☆927Updated 2 weeks ago
Alternatives and similar repositories for kraken
Users that are interested in kraken are comparing it to the libraries listed below
Sorting:
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆405Updated last year
- A deep learning toolkit specialized for handwritten document analysis☆251Updated 2 months ago
- Library used to deskew a scanned document☆495Updated this week
- Document Layout Analysis☆392Updated 2 weeks ago
- Line based ATR Engine based on OCRopy☆1,178Updated 7 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆194Updated last month
- ☆997Updated last year
- Train Tesseract LSTM with make☆709Updated 8 months ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆132Updated 2 weeks ago
- Document Layout Analysis resources repos for development with PdfPig.☆629Updated 2 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆521Updated 4 years ago
- Generic framework for historical document processing☆382Updated 4 years ago
- OCR software for recognition of handwritten text☆823Updated 3 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆59Updated 4 years ago
- Handwritten Text Recognition using TensorFlow☆287Updated last year
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 6 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆197Updated 7 months ago
- Detect and read handwritten words on scanned pages.☆134Updated 2 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆276Updated 5 years ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆217Updated 2 years ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆401Updated 2 years ago
- Detect handwritten words (classic image processing based method).☆274Updated 2 years ago
- Apply different text recognition services to images of handwritten documents.☆188Updated 3 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆631Updated last year
- ☆1,034Updated 5 months ago
- Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Docum…☆326Updated 2 years ago
- Python library to extract tabular data from images and scanned PDFs☆285Updated last year
- Pretrained mixed models to be used with Calamari.☆67Updated last year