mittagessen / krakenLinks
OCR engine for all the languages
☆927Updated 2 weeks ago
Alternatives and similar repositories for kraken
Users that are interested in kraken are comparing it to the libraries listed below
Sorting:
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆405Updated last year
- A deep learning toolkit specialized for handwritten document analysis☆251Updated 2 months ago
- Document Layout Analysis☆392Updated 2 weeks ago
- Library used to deskew a scanned document☆495Updated this week
- Train Tesseract LSTM with make☆709Updated 8 months ago
- Line based ATR Engine based on OCRopy☆1,178Updated 7 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆194Updated last month
- ☆997Updated last year
- Generic framework for historical document processing☆382Updated 4 years ago
- Python library to extract tabular data from images and scanned PDFs☆285Updated last year
- Document Layout Analysis resources repos for development with PdfPig.☆629Updated 2 years ago
- Pretrained mixed models to be used with Calamari.☆67Updated last year
- Detect and read handwritten words on scanned pages.☆134Updated 2 years ago
- OCR software for recognition of handwritten text☆823Updated 3 years ago
- Handwritten Text Recognition using TensorFlow☆287Updated last year
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆217Updated 2 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆521Updated 4 years ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆197Updated 7 months ago
- ☆1,034Updated 5 months ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆132Updated 2 weeks ago
- Links to awesome OCR projects☆3,077Updated last year
- Detect handwritten words (classic image processing based method).☆274Updated 2 years ago
- Apply different text recognition services to images of handwritten documents.☆188Updated 3 years ago
- The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.☆151Updated 4 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 3 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆276Updated 5 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆631Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆235Updated last year
- Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Docum…☆326Updated 2 years ago