Pre-Recognize Library - library with algorithms for improving OCR quality.
☆112May 2, 2023Updated 2 years ago
Alternatives and similar repositories for PRLib
Users that are interested in PRLib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- User contributed (non Google) OCR models for Tesseract☆31Apr 18, 2025Updated 11 months ago
- ☆10Jan 22, 2023Updated 3 years ago
- A post-processing tool for scanned sheets of paper.☆1,168Jul 11, 2024Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆202May 21, 2025Updated 10 months ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- ☆25Apr 22, 2018Updated 7 years ago
- A set of tools for rotating, cropping, and binding the images from a scanned book into a PDF.☆19Aug 15, 2018Updated 7 years ago
- Unofficial PyTorch implementation of GLAMpoints: Greedily Learned Accurate Match points☆28Jun 22, 2022Updated 3 years ago
- A small Docker built for the OCRopus OCR system.☆19Dec 16, 2017Updated 8 years ago
- Building OCR using YOLO and Tesseract☆96Sep 6, 2021Updated 4 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆13Aug 21, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- ☆17Sep 25, 2021Updated 4 years ago
- Automatic skew correction using corner detectors and homography with opencv tools☆40Sep 24, 2019Updated 6 years ago
- Train Tesseract LSTM with make☆718Apr 18, 2025Updated 11 months ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆142Feb 22, 2021Updated 5 years ago
- This repository contains NLU related material for the I833 Deep Learning course at University of Applied Sciences Dresden☆13Dec 16, 2024Updated last year
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,040Mar 24, 2026Updated 3 weeks ago
- Library used to deskew a scanned document☆507Apr 2, 2026Updated last week
- Repository for code from "On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference" (StarSem 2019) and "Don’t Take th…☆15Apr 6, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Text page dewarping using a "cubic sheet" model☆1,508Mar 2, 2023Updated 3 years ago
- ☆149Jul 2, 2020Updated 5 years ago
- An application of high resolution GANs to dewarp images of perturbed documents☆151Oct 18, 2021Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Unofficial mirror of pdftk - imported using git-ubuntu☆10Aug 20, 2018Updated 7 years ago
- My talk on using LSTM & HTM for anomaly detection @ Google Developers Group.☆10Jun 11, 2017Updated 8 years ago
- This is an android tablet application for taking notes in class. It will use handwriting recognition in real time to digitize your notes…☆15Sep 1, 2014Updated 11 years ago
- ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones …☆1,442Sep 13, 2023Updated 2 years ago
- Python bindings for libwapiti☆67Dec 9, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Train Tesseract LSTM with tesstrain.sh on Windows☆26Dec 24, 2023Updated 2 years ago
- A simple universal data description format for datasets, tailored for interfacing with humans.☆25Feb 16, 2021Updated 5 years ago
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆24Apr 17, 2025Updated 11 months ago
- Pre-Recognition Library - library with algorithms for improving OCR quality.☆37Mar 20, 2021Updated 5 years ago
- ☆16Mar 28, 2025Updated last year
- Automated Bangla License Plate Detection and Recognition - Implementation. For thesis report visit- https://github.com/dipu-bd/thesis-rep…☆18Apr 1, 2018Updated 8 years ago
- OCR-D-compliant page segmentation☆67Nov 19, 2025Updated 4 months ago