User contributed (non Google) OCR models for Tesseract
☆31Apr 18, 2025Updated last year
Alternatives and similar repositories for tessdata_contrib
Users that are interested in tessdata_contrib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for tesseract testing☆35Jun 9, 2024Updated last year
- Data used for LSTM model training☆126Mar 9, 2024Updated 2 years ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆21May 20, 2026Updated last week
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 6 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- ☆14May 27, 2024Updated last year
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆202May 21, 2025Updated last year
- Tool to OCR PDFs using Google Cloud Vision☆42Dec 7, 2022Updated 3 years ago
- A polyfill for XSLTProcessor☆39May 14, 2026Updated last week
- ☆24Apr 8, 2026Updated last month
- A documentation for FAIR GPT, a virtual RDM consultant☆16Oct 10, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Read-only unofficial mirror of Pynini☆17May 7, 2019Updated 7 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆112May 2, 2023Updated 3 years ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated last year
- DITA Open Toolkit project website · dita-ot.org☆16Updated this week
- Kiwix Catalog BitTorrent Seeder Companion☆17Dec 8, 2025Updated 5 months ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 6 months ago
- This plugin provides a useful feature for multi-language☆14Jul 15, 2022Updated 3 years ago
- In-browser textual tone analyzer using window.ai API☆12Jul 22, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Jenkins groovy plugin☆34May 1, 2026Updated 3 weeks ago
- An OCR evaluation tool☆69Aug 22, 2025Updated 9 months ago
- guides and test data for OCR4all☆32Oct 4, 2022Updated 3 years ago
- Kitodo.Publication☆14Updated this week
- Tools for working with the Wayback Machine in Rust☆15Dec 17, 2025Updated 5 months ago
- A podcast transcription service built on Azure that transcribes any new episode of your podcast and displays synchronized transcripts alo…☆10Dec 10, 2022Updated 3 years ago
- Core libraries by the PRImA Research Lab☆16Jul 30, 2024Updated last year
- An example project demonstrating how to perform OCR with multi-modal LLMs☆10Mar 14, 2024Updated 2 years ago
- Antenna House PDF5-ML DITA-OT Plug-in☆24Apr 23, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repository for the source used to build the firefox snap (published by Mozilla)☆22Updated this week
- Selected code and data for The Online Books Page and related applications☆11May 4, 2026Updated 3 weeks ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 8 years ago
- ☆18Sep 25, 2021Updated 4 years ago
- Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts☆10Oct 11, 2020Updated 5 years ago
- ☆11Jan 2, 2022Updated 4 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago