leha-bot / PRLib
Pre-Recognize Library - library with algorithms for improving OCR quality.
☆104Updated last year
Alternatives and similar repositories for PRLib:
Users that are interested in PRLib are comparing it to the libraries listed below
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆181Updated last month
- The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.☆149Updated 3 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 2 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆121Updated last week
- An application of high resolution GANs to dewarp images of perturbed documents☆131Updated 3 years ago
- Pretrained mixed models to be used with Calamari.☆60Updated 3 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆183Updated 3 months ago
- OCR-D-compliant page segmentation☆67Updated 4 months ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Updated 3 years ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- A selectional auto-encoder approach for document image binarization☆102Updated 2 years ago
- Document Scanner and Word Segmentation☆121Updated 4 years ago
- Document image dewarping library using a cubic sheet model☆129Updated this week
- Document Image Binarization☆75Updated 3 months ago
- Document Boundary & Canny Edge Detection using OpenCV☆62Updated 6 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 9 months ago
- Detect textlines in document images☆91Updated 7 months ago
- Perspective recovery of text using transformed ellipses☆148Updated 3 years ago
- OCR-D python tools☆33Updated 5 months ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 4 months ago
- Generic framework for historical document processing☆373Updated 3 years ago
- Pre-Recognition Library - library with algorithms for improving OCR quality.☆34Updated 3 years ago
- Text detection with mainly MSER and SWT☆198Updated 2 months ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 3 months ago
- A collection of tools for cleaning up book scans.☆137Updated 2 years ago
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- ☆16Updated 3 years ago
- The hOCR Embedded OCR Workflow and Output Format☆73Updated 5 months ago
- A simple document layout analysis using Python-OpenCV☆124Updated 4 years ago