Pre-Recognize Library - library with algorithms for improving OCR quality.
☆112May 2, 2023Updated 3 years ago
Alternatives and similar repositories for PRLib
Users that are interested in PRLib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- User contributed (non Google) OCR models for Tesseract☆31Apr 18, 2025Updated last year
- ☆10Jan 22, 2023Updated 3 years ago
- A post-processing tool for scanned sheets of paper.☆1,177Jul 11, 2024Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆202May 21, 2025Updated 11 months ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Documentation and use cases for ALTO XML☆42Sep 10, 2018Updated 7 years ago
- ☆25Apr 22, 2018Updated 8 years ago
- A set of tools for rotating, cropping, and binding the images from a scanned book into a PDF.☆20Aug 15, 2018Updated 7 years ago
- Unofficial PyTorch implementation of GLAMpoints: Greedily Learned Accurate Match points☆28Jun 22, 2022Updated 3 years ago
- Building OCR using YOLO and Tesseract☆95Sep 6, 2021Updated 4 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆13Aug 21, 2025Updated 8 months ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆20Mar 24, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- ☆17Sep 25, 2021Updated 4 years ago
- Train Tesseract LSTM with make☆722Apr 18, 2025Updated last year
- This repository contains NLU related material for the I833 Deep Learning course at University of Applied Sciences Dresden☆13Dec 16, 2024Updated last year
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,048Apr 18, 2026Updated 2 weeks ago
- Library used to deskew a scanned document☆515Updated this week
- Text page dewarping using a "cubic sheet" model☆1,512Mar 2, 2023Updated 3 years ago
- ☆20Aug 18, 2019Updated 6 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A example of mixing ImGui and SDL2☆10Apr 3, 2019Updated 7 years ago
- Paymob Integration in Android using the new method, which is the Mobile SDKs to make it easier for developers to use Paymob functionalit…☆26Apr 21, 2026Updated 2 weeks ago
- A documentation for FAIR GPT, a virtual RDM consultant☆16Oct 10, 2024Updated last year
- This is an android tablet application for taking notes in class. It will use handwriting recognition in real time to digitize your notes…☆15Sep 1, 2014Updated 11 years ago
- Python bindings for libwapiti☆67Dec 9, 2019Updated 6 years ago
- Train Tesseract LSTM with tesstrain.sh on Windows☆26Dec 24, 2023Updated 2 years ago
- Read-only unofficial mirror of Pynini☆17May 7, 2019Updated 6 years ago
- A simple universal data description format for datasets, tailored for interfacing with humans.☆25Feb 16, 2021Updated 5 years ago
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆24Apr 17, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pre-Recognition Library - library with algorithms for improving OCR quality.☆37Mar 20, 2021Updated 5 years ago
- ☆16Mar 28, 2025Updated last year
- OCR-D-compliant page segmentation☆67Nov 19, 2025Updated 5 months ago
- 基于DUILib和Tesseract的OCR识别工具☆18Jan 2, 2016Updated 10 years ago
- A curated list of awesome projects to simplify and improve paper and document scanning.☆508Apr 23, 2026Updated last week
- Development version of ndlstm, multidimensional LSTMs for TensorFlow☆19Feb 20, 2018Updated 8 years ago
- A selectional auto-encoder approach for document image binarization☆104Dec 8, 2022Updated 3 years ago