Crop And Splice Segments (of scanned pages)
☆14Mar 11, 2019Updated 7 years ago
Alternatives and similar repositories for crass
Users that are interested in crass are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 3 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated 2 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- IIIF Examples and useful code☆20Sep 10, 2025Updated 6 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- View HOCR files with Mirador☆29Sep 27, 2017Updated 8 years ago
- Ergonomic line-by-line transcription of scanned text.☆54Feb 2, 2026Updated last month
- JS for overlaying OCR on image using HOCR formatted HTML☆26Jul 30, 2016Updated 9 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- Linked Data Rendering for humans☆17Oct 4, 2022Updated 3 years ago
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata☆13Jan 13, 2016Updated 10 years ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆14Dec 6, 2025Updated 3 months ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Japanese trained data of clstm☆15Jun 6, 2016Updated 9 years ago
- Python tools for Tesseract OCR training☆27May 2, 2022Updated 3 years ago
- Mannheim library utilities☆27Dec 29, 2025Updated 2 months ago
- The base class from which to create a CWRC-Writer XML editor.☆14Apr 18, 2023Updated 2 years ago
- LINKED DATA QUALITY REPORTS☆41May 20, 2022Updated 3 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Command-line client for the DataCite Metadata Store (MDS)☆18Mar 9, 2021Updated 5 years ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Sep 25, 2025Updated 6 months ago
- Page-wise text recognition with lower-supervision line data models☆52Mar 11, 2026Updated 2 weeks ago
- QA-tool for scans with corresponding ALTO-files☆26Dec 2, 2022Updated 3 years ago
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- A specification for a jsonld nodejs stream☆15Oct 13, 2018Updated 7 years ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- A MongoDB implementation of the W3C Web Annotation Protocol☆18Jun 3, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Jan 22, 2023Updated 3 years ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Sep 18, 2025Updated 6 months ago
- ☆10Mar 16, 2023Updated 3 years ago
- Transforming SemRep Predications into an Open Biomedical Linked Data Resource☆11Jan 26, 2018Updated 8 years ago
- End-2-end multi-label classification in python☆32Nov 21, 2022Updated 3 years ago
- A pythonic Linked Data Notifications (LDN) receiver☆14Jan 31, 2019Updated 7 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Jul 4, 2025Updated 8 months ago