tesseract-shadow / tesseract-ocr-compilation
Tesseract 4 OCR Compilation - Docker Container
☆53Updated 2 years ago
Related projects: ⓘ
- Tesseract 4 OCR Runtime Environment - Docker Container☆97Updated 5 years ago
- OCR evaluation brought to you by University of Alicante☆66Updated 2 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆76Updated 2 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆179Updated last month
- Train Tesseract LSTM with make☆626Updated 3 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆363Updated last month
- Various documents related to Tesseract OCR☆260Updated 3 years ago
- Pretrained mixed models to be used with Calamari.☆55Updated 3 years ago
- Detect and fix skew in images containing text☆260Updated 5 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆38Updated last month
- Dockerized example to train Tesseract v. 4☆64Updated last year
- ☆16Updated 3 years ago
- An expandable and scalable OCR pipeline☆86Updated 6 years ago
- Page to PAGE Layout Analysis Tool☆190Updated 2 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆176Updated last month
- An implementation of RESTful web service for tesseract-OCR using tornado☆134Updated last year
- Data used for LSTM model training☆115Updated 6 months ago
- Recognition Models for Kraken and CLSTM☆13Updated 5 years ago
- The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes…☆148Updated 5 years ago
- Document Scanner and Word Segmentation☆117Updated 4 years ago
- The hOCR Embedded OCR Workflow and Output Format☆72Updated last month
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆13Updated 7 years ago
- PAGE XML format collection for document image page content and more☆62Updated 3 years ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆142Updated 3 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆55Updated 3 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆101Updated last year
- ☆23Updated this week
- User contributed (non Google) OCR models for Tesseract☆19Updated last year
- Tesseract documentation☆75Updated 3 years ago
- The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.☆144Updated 2 years ago