Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.
☆31Nov 3, 2017Updated 8 years ago
Alternatives and similar repositories for git-lit
Users that are interested in git-lit are comparing it to the libraries listed below
Sorting:
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- Stand-alone implementation of UCD's IIIF image re-formatting tool + plugin to integrate with Mirador IIIF-compliant image viewer☆18Jul 31, 2017Updated 8 years ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- Development version of ndlstm, multidimensional LSTMs for TensorFlow☆19Feb 20, 2018Updated 8 years ago
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- An expandable and scalable OCR pipeline☆89Nov 14, 2017Updated 8 years ago
- QA-tool for scans with corresponding ALTO-files☆26Dec 2, 2022Updated 3 years ago
- Convert ALTO XML to plain text + minimal metadata☆17Oct 17, 2024Updated last year
- ☆11Feb 13, 2026Updated 2 weeks ago
- Tools for TICCL☆14Dec 12, 2025Updated 2 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- The Open Attribute project's chrome extension☆18Jan 22, 2015Updated 11 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- ☆10Mar 16, 2023Updated 2 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 6 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated 10 months ago
- Documentation for the GITenberg books project☆29Oct 31, 2018Updated 7 years ago
- Rails application supporting the creation of OCR and the IIIF Content Search API☆34Dec 14, 2022Updated 3 years ago
- InfiniteUlysses.com repo as it was when I finished the related Ph.D. project. See instead github.com/amandavisconti/infinite-ulysses-publ…☆26Mar 15, 2022Updated 3 years ago
- Polytonic Greek OCR tool suite based on Ocropus 0.7☆13Jul 5, 2023Updated 2 years ago
- ☆13May 16, 2019Updated 6 years ago
- Data and code for analyzing language associated with fictional characters.☆15Jan 6, 2018Updated 8 years ago
- BibJSON spec and website☆20Mar 19, 2015Updated 10 years ago
- React components for rendering Data Package Views.☆16Dec 10, 2022Updated 3 years ago
- Legal Code for the State of Utah☆44Apr 8, 2014Updated 11 years ago
- An Editor for creating simple or complex OCR workflows☆17Jun 13, 2024Updated last year
- ☆20Aug 18, 2019Updated 6 years ago
- A Jekyll-based static site generator for archival description in JSON.☆34Sep 18, 2025Updated 5 months ago
- In-gallery iPad experience☆18Feb 24, 2016Updated 10 years ago
- ☆16Feb 23, 2015Updated 11 years ago
- A compile-to-JSON data pipeline scripting language [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ …☆43Jan 6, 2022Updated 4 years ago
- Efficient hOCR tooling☆55Aug 18, 2025Updated 6 months ago
- Web application for transcribing OCR ground truth from Archive.org☆17Feb 22, 2018Updated 8 years ago
- Histonets is an application to convert images of scanned maps into digital networks☆20Oct 16, 2017Updated 8 years ago
- Presentations, tutorials and data for the OCR workshop at LMU☆16Jun 2, 2017Updated 8 years ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago