mittagessen / kraken
OCR engine for all the languages
☆767Updated this week
Alternatives and similar repositories for kraken:
Users that are interested in kraken are comparing it to the libraries listed below
- Document Layout Analysis☆359Updated 3 weeks ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆375Updated 5 months ago
- A deep learning toolkit specialized for handwritten document analysis☆211Updated 4 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆181Updated last month
- Line based ATR Engine based on OCRopy☆1,065Updated 2 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆183Updated 3 months ago
- Library used to deskew a scanned document☆434Updated last week
- Generic framework for historical document processing☆373Updated 3 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated last year
- Collection of OCR-related python tools and wrappers from @OCR-D☆121Updated last week
- Provides OCR (Optical Character Recognition) services through web applications☆245Updated 11 months ago
- ☆912Updated 4 months ago
- A post-processing tool for scanned sheets of paper.☆1,055Updated 6 months ago
- OCR software for recognition of handwritten text☆779Updated 2 years ago
- Train Tesseract LSTM with make☆653Updated 7 months ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆271Updated 4 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 2 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆204Updated last year
- Pretrained mixed models to be used with Calamari.☆60Updated 3 months ago
- An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholdi…☆514Updated 2 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 3 months ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆511Updated 3 years ago
- Python library to extract tabular data from images and scanned PDFs☆270Updated 5 months ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆592Updated 5 months ago
- Detect and read handwritten words on scanned pages.☆113Updated last year
- Ocular is a state-of-the-art historical OCR system.☆258Updated 7 months ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Updated 3 years ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,415Updated 5 months ago