buliasz / tesstrain-windows-gui
Train Tesseract LSTM with GUI on Windows
☆33Updated 6 months ago
Related projects: ⓘ
- Train Tesseract LSTM with make☆626Updated 3 months ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆76Updated 2 years ago
- Tutorial on how to deskew (straighten) text images☆50Updated 2 years ago
- Document image dewarping library using a cubic sheet model☆98Updated this week
- Train Tesseract LSTM with tesstrain.sh on Windows☆24Updated 8 months ago
- Detect and read handwritten words on scanned pages.☆99Updated last year
- Pretrained mixed models to be used with Calamari.☆55Updated 3 years ago
- Dockerized example to train Tesseract v. 4☆64Updated last year
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆34Updated 9 months ago
- Master repository which includes most other OCR-D repositories as submodules☆71Updated last month
- OCR evaluation brought to you by University of Alicante☆66Updated 2 years ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆120Updated 2 months ago
- Detect textlines in document images☆88Updated 3 months ago
- Repository mentioned in https://youtu.be/KE4xEzFGSU8☆28Updated last year
- ☆8Updated 4 years ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆137Updated 11 months ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆118Updated this week
- Detect handwritten words (neural network based).☆64Updated 2 years ago
- Restful API Wrapper for EasyOCR☆34Updated 3 years ago
- a repository for training Tesseract OCR easily using scripts.☆9Updated 4 years ago
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆12Updated 3 years ago
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 3 years ago
- Python library to extract tabular data from images and scanned PDFs☆255Updated last month
- OCR-D python tools☆33Updated last month
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆179Updated last month
- Tool for enhancing noisy scanned text images☆47Updated 4 years ago
- OCR engine for all the languages☆717Updated this week
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆176Updated last year
- A function that takes as input a cropped text line image, and outputs the dewarped image.☆15Updated 2 weeks ago
- Document Layout Analysis☆335Updated this week