UB-Mannheim / tesseract
Tesseract Open Source OCR Engine (main repository)
☆3,038Updated last week
Related projects: ⓘ
- Best (most accurate) trained LSTM models.☆1,212Updated 6 months ago
- Tesseract documentation☆1,759Updated this week
- Trained models with fast variant of the "best" LSTM models + legacy models☆6,294Updated 6 months ago
- A Python wrapper for Google Tesseract☆5,769Updated last month
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,591Updated last month
- Fast integer versions of trained LSTM models☆472Updated last month
- Train Tesseract LSTM with make☆626Updated 3 months ago
- A Gtk/Qt front-end to tesseract-ocr.☆1,604Updated 3 weeks ago
- Download Poppler binaries packaged for Windows with dependencies☆524Updated last month
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,759Updated 3 weeks ago
- A Python wrapper for the tesseract-ocr API☆1,990Updated 3 weeks ago
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆8,068Updated this week
- Community maintained fork of pdfminer - we fathom PDF☆5,820Updated last month
- Source training data for Tesseract for lots of languages☆833Updated 6 months ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,386Updated last month
- A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.☆10,172Updated last month
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆23,729Updated last month
- Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame☆2,148Updated 2 weeks ago
- Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-head…☆4,438Updated last month
- Python bindings for FFmpeg - with complex filtering support☆9,881Updated last month
- Line based ATR Engine based on OCRopy☆1,037Updated last month
- Python for Windows (pywin32) Extensions☆4,987Updated this week
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,247Updated last year
- A Python library for reading and writing PDF, powered by QPDF☆2,135Updated this week
- Links to awesome OCR projects☆2,752Updated 2 months ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,170Updated 2 months ago
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆13,650Updated this week
- A Python Perceptual Image Hashing Module☆3,124Updated 3 months ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆5,097Updated this week
- WebDriver for Firefox☆7,120Updated last month