tmbarchive / ocropus
The OCRopus OCR System
☆11Updated 10 years ago
Alternatives and similar repositories for ocropus:
Users that are interested in ocropus are comparing it to the libraries listed below
- Converters for various file formats used for representing OCR☆12Updated 10 months ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Updated 4 years ago
- ALTO XML schema - latest and all former versions☆52Updated 7 months ago
- The CIS OCR PostCorrectionTool☆41Updated 2 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆201Updated 6 months ago
- guides and test data for OCR4all☆30Updated 2 years ago
- files and code related to the Early Modern OCR Project (eMOP) at the IDHMC☆16Updated 10 years ago
- Python-based research framework for developing, organizing, and deploying Deep Learning models powered by Tensorflow.☆12Updated 2 years ago
- Image processing and image analysis software. (Mirror of source)☆20Updated 13 years ago
- CVL/READ Modules including Basic Layout Analysis and Writer Identification/Retrieval☆9Updated 6 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- ☆27Updated last year
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆185Updated 3 months ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated last year
- Working with hOCR in Javascript☆126Updated 2 years ago
- View HOCR files with Mirador☆27Updated 7 years ago
- Create PDFs and plain text from hOCR documents☆34Updated 14 years ago
- Conversions between various OCR formats☆74Updated last year
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 9 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 7 months ago
- Pretrained mixed models to be used with Calamari.☆61Updated 5 months ago
- ☆9Updated this week
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Updated this week
- A desktop wrapper for Mirador and its environment, allowing use of local images.☆14Updated 6 years ago
- Goobi viewer - Presentation software for digital libraries, museums, archives and galleries. Open Source.☆21Updated this week
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 3 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆187Updated last month
- Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Libr…☆39Updated this week
- Crop And Splice Segments (of scanned pages)☆14Updated 6 years ago