tesseract-ocr / tessdocLinks
Tesseract documentation
☆2,182Updated 3 weeks ago
Alternatives and similar repositories for tessdoc
Users that are interested in tessdoc are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,426Updated last year
- Fast integer versions of trained LSTM models☆567Updated last year
- Train Tesseract LSTM with make☆698Updated 5 months ago
- Tesseract Open Source OCR Engine (main repository)☆3,803Updated 3 months ago
- OCR engine for all the languages☆894Updated this week
- Line based ATR Engine based on OCRopy☆1,165Updated 4 months ago
- Links to awesome OCR projects☆3,049Updated last year
- Download Poppler binaries packaged for Windows with dependencies☆945Updated last month
- Python-tesseract is an optical character recognition (OCR) tool for python☆169Updated 7 years ago
- Source training data for Tesseract for lots of languages☆858Updated 6 months ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,466Updated 2 weeks ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆792Updated last month
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆521Updated 4 years ago
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆216Updated last week
- Demos, examples and utilities using PyMuPDF☆684Updated last year
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,101Updated last year
- ☆980Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆397Updated last year
- ☆146Updated 5 years ago
- qpdf: A content-preserving PDF document transformer☆4,328Updated last week
- Library used to deskew a scanned document☆488Updated last week
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,488Updated last month
- Various documents related to Tesseract OCR☆265Updated 4 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆276Updated 5 years ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆152Updated 2 years ago
- Python bindings to PDFium, reasonably cross-platform.☆652Updated this week
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 3 years ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,749Updated last year
- ☆1,023Updated 3 months ago
- An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholdi…☆578Updated 2 years ago