gerwin3 / autocroppy
Autocroppy is an improvement on traditional autocrop tools. Designed to remove black borders of scanned documents or old slides it stands out on twisted pictures or pictures that have rounded corners.
☆15Updated 11 years ago
Alternatives and similar repositories for autocroppy:
Users that are interested in autocroppy are comparing it to the libraries listed below
- ☆7Updated 5 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆186Updated 2 weeks ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 7 months ago
- A repository for online OCRD training infrastructure.☆13Updated 4 years ago
- guides and test data for OCR4all☆30Updated 2 years ago
- Tool for editing MEI header data☆20Updated 3 years ago
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last week
- Image Annotation Tool and Image Search☆14Updated 2 weeks ago
- ☆31Updated last month
- eComparatio: text diff and support for digital edition☆23Updated 4 years ago
- Conversions between various OCR formats☆74Updated last year
- ☆57Updated last week
- Documentation and use cases for ALTO XML☆41Updated 6 years ago
- Pretrained mixed models to be used with Calamari.☆60Updated 4 months ago
- Converters for various file formats used for representing OCR☆12Updated 10 months ago
- (MIRROR of https://gitlab.com/vgg/vise) VGG Image Search Engine (VISE) is a standalone software for visual search of large image collecti…☆46Updated 5 months ago
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆27Updated 2 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆184Updated 2 months ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆23Updated 4 years ago
- Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)☆36Updated 9 years ago
- A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.☆10Updated 7 years ago
- ☆24Updated last week
- Ground Truth Resources for the HTR of patrimonial documents☆40Updated this week
- The CIS OCR PostCorrectionTool☆41Updated 2 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 3 years ago
- XSLT Quality☆25Updated 3 months ago
- ☆20Updated 5 years ago
- A plugin that provides support for working with Digital Facsimiles in Text Encoding Initiative (TEI) vocabulary. The plugin contribute…☆25Updated 3 years ago