User contributed (non Google) OCR models for Tesseract
☆31Apr 18, 2025Updated 11 months ago
Alternatives and similar repositories for tessdata_contrib
Users that are interested in tessdata_contrib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆20Updated this week
- Master repository which includes most other OCR-D repositories as submodules☆72Jul 4, 2025Updated 8 months ago
- Tesseract Config files☆32Sep 12, 2021Updated 4 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- ☆13May 27, 2024Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆201May 21, 2025Updated 10 months ago
- Tool to OCR PDFs using Google Cloud Vision☆42Dec 7, 2022Updated 3 years ago
- ☆10Jan 22, 2023Updated 3 years ago
- This repository contains NLU related material for the I833 Deep Learning course at University of Applied Sciences Dresden☆13Dec 16, 2024Updated last year
- A polyfill for XSLTProcessor☆36Mar 13, 2026Updated 2 weeks ago
- ☆24Jan 14, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Read-only unofficial mirror of Pynini☆17May 7, 2019Updated 6 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆112May 2, 2023Updated 2 years ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- Kiwix Catalog BitTorrent Seeder Companion☆15Dec 8, 2025Updated 3 months ago
- A Hypothes.is integration plugin for OJS☆12Mar 17, 2025Updated last year
- In-browser textual tone analyzer using window.ai API☆12Jul 22, 2024Updated last year
- Pangolin VPN client for Apple devices☆26Mar 20, 2026Updated last week
- An OCR evaluation tool☆69Aug 22, 2025Updated 7 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- guides and test data for OCR4all☆32Oct 4, 2022Updated 3 years ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- Calculate IPv6 and NAT64 reachability scores for websites☆11Feb 21, 2017Updated 9 years ago
- Tools for working with the Wayback Machine in Rust☆15Dec 17, 2025Updated 3 months ago
- Core libraries by the PRImA Research Lab☆16Jul 30, 2024Updated last year
- An example project demonstrating how to perform OCR with multi-modal LLMs☆10Mar 14, 2024Updated 2 years ago
- Selected code and data for The Online Books Page and related applications