tesseract-ocr / tessdoc
Tesseract documentation
☆2,025Updated 3 months ago
Alternatives and similar repositories for tessdoc:
Users that are interested in tessdoc are comparing it to the libraries listed below
- Best (most accurate) trained LSTM models.☆1,334Updated last year
- Tesseract Open Source OCR Engine (main repository)☆3,501Updated last week
- Fast integer versions of trained LSTM models☆534Updated 9 months ago
- Train Tesseract LSTM with make☆673Updated 2 weeks ago
- Source training data for Tesseract for lots of languages☆857Updated last month
- Download Poppler binaries packaged for Windows with dependencies☆772Updated 5 months ago
- OCR engine for all the languages☆822Updated last week
- Line based ATR Engine based on OCRopy☆1,134Updated 3 weeks ago
- Links to awesome OCR projects☆2,969Updated 10 months ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,450Updated 9 months ago
- ☆949Updated 7 months ago
- A Gtk/Qt front-end to tesseract-ocr.☆1,751Updated 2 weeks ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆287Updated last year
- Open source Python library for converting PDF to DOCX.☆2,907Updated 2 weeks ago
- A synthetic data generator for text recognition☆3,467Updated 9 months ago
- Demos, examples and utilities using PyMuPDF☆654Updated 10 months ago
- Box editor and trainer for Tesseract OCR☆239Updated 10 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆4,622Updated last week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆26,534Updated 7 months ago
- Data used for LSTM model training☆117Updated last year
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆176Updated last week
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,595Updated 10 months ago
- Train Tesseract LSTM with GUI on Windows☆39Updated last year
- Library used to deskew a scanned document☆460Updated last week
- mupdf mirror☆2,047Updated this week
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆520Updated 4 years ago
- ☆142Updated 4 years ago
- A Python tool to help extracting information from structured PDFs.☆403Updated last month
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,530Updated 5 months ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆4,012Updated last week