Various documents related to Tesseract OCR
☆267Sep 12, 2021Updated 4 years ago
Alternatives and similar repositories for docs
Users that are interested in docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tesseract documentation☆75Sep 12, 2021Updated 4 years ago
- Source training data for Tesseract for lots of languages☆866Apr 1, 2025Updated last year
- Trained models with fast variant of the "best" LSTM models + legacy models☆7,503Mar 9, 2024Updated 2 years ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Sep 24, 2015Updated 10 years ago
- Python-based tools for document analysis and OCR☆3,471May 22, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A small Docker built for the OCRopus OCR system.☆19Dec 16, 2017Updated 8 years ago
- ☆16Mar 24, 2021Updated 5 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆831Oct 24, 2019Updated 6 years ago
- Train Tesseract LSTM with make☆722Apr 18, 2025Updated last year
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆18Jun 15, 2017Updated 8 years ago
- A tools can generate samples for OCR trainning. 用于OCR的字符样本生成工具☆65Oct 22, 2017Updated 8 years ago
- Free open-source OCR application for the Windows Desktop - A modern GUI front-end for the Tesseract OCR engine. The application also incl…☆267Apr 11, 2015Updated 11 years ago
- A Python wrapper for the tesseract-ocr API☆2,161Mar 16, 2026Updated last month
- PyData Boston 2013 talks: "Intro to scikit-learn" & "Realtime Predictive Analytics: Using scikit-learn and RabbitMQ"☆11Jan 5, 2014Updated 12 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fast integer versions of trained LSTM models☆595Aug 1, 2024Updated last year
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12May 24, 2023Updated 2 years ago
- Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowd…☆14Oct 3, 2017Updated 8 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Feb 6, 2022Updated 4 years ago
- Links to awesome OCR projects☆3,102Jul 6, 2024Updated last year
- This project shows how to extract the title from the image☆10Oct 23, 2018Updated 7 years ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Feb 22, 2021Updated 5 years ago
- Best (most accurate) trained LSTM models.☆1,542Mar 9, 2024Updated 2 years ago
- A CLI tool for pulling in election results from sites using Clarity☆18Dec 6, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆196Updated this week
- a general list of resources and articles for people interested in getting into data journalism☆16Apr 12, 2023Updated 3 years ago
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated last year
- Code from NICAR 2020☆17Jan 28, 2021Updated 5 years ago
- The extra Zookeeper support for Illyria client pool.☆11Jul 3, 2024Updated last year
- Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)☆37Sep 24, 2015Updated 10 years ago
- FOIL resources for New York City and New York State☆18Dec 16, 2015Updated 10 years ago
- Allows Apps to authenticate point to point. For higly trusted Apps and Sandbox contexts.☆22Nov 30, 2021Updated 4 years ago
- Basic Optical Character Recognition Tutorial. Damiles Blog.☆120Jan 8, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- support English and Chinese character☆15Jun 28, 2016Updated 9 years ago
- This is a reading list for deep learning for OCR☆343Nov 4, 2017Updated 8 years ago
- Very basic Tesseract-OCR example with CPPAN. Cppan support is discontinued. Please use sw (cppan v2) instead. Updated example is here: ht…☆31Jul 9, 2018Updated 7 years ago
- This is the home for the Azure Resource Manager (ARM) templates, supporting extensions, lab creation and lab manuals that allow a user to…☆10Sep 14, 2017Updated 8 years ago
- A tutorial on the PyTorch-based ocropus components.☆73Apr 18, 2020Updated 6 years ago
- Tesseract Config files☆35Sep 12, 2021Updated 4 years ago
- ☆14Feb 13, 2021Updated 5 years ago