rescribe / bookpipelineView external linksLinks
Tools to process books in a cloud based pipeline system
☆65Dec 4, 2025Updated 2 months ago
Alternatives and similar repositories for bookpipeline
Users that are interested in bookpipeline are comparing it to the libraries listed below
Sorting:
- Self hosting code for Recogito-Studio☆20Oct 16, 2025Updated 4 months ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Jan 17, 2020Updated 6 years ago
- A static site generator for TEI Publisher☆13Mar 8, 2022Updated 3 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆199May 21, 2025Updated 8 months ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Jan 26, 2022Updated 4 years ago
- Tools for working with CSV files☆17Sep 19, 2012Updated 13 years ago
- A VUE IIIF viewer☆14Dec 14, 2025Updated 2 months ago
- Wrapper for the kraken OCR engine☆13Jul 12, 2025Updated 7 months ago
- Command Line Interface for running 🤗 Transformers Image Classification locally☆19May 8, 2025Updated 9 months ago
- Greek texts (eventually) with linguistic annotation (for Greek Learner Texts Project)☆15Jun 16, 2023Updated 2 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Jan 20, 2026Updated 3 weeks ago
- Given the URL to a public JSON document in an International Image Interoperability Framework (IIIF) image server, this script will downlo…☆16Sep 6, 2022Updated 3 years ago
- Documentation and use cases for ALTO XML☆42Sep 10, 2018Updated 7 years ago
- File detector, metadata collector and well-formedness checker tool☆18Feb 3, 2026Updated last week
- Reading mdict files, support MDX/MDD file formats.☆18Feb 3, 2026Updated last week
- ALTO XML schema - latest and all former versions☆55Jan 20, 2026Updated 3 weeks ago
- Note: the repo has been moved to https://gitlab.com/readcoop/Transkribus/TranskribusSwtGui☆18Oct 27, 2020Updated 5 years ago
- Goobi viewer - Presentation software for digital libraries, museums, archives and galleries. Open Source.☆25Feb 6, 2026Updated last week
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Sep 25, 2025Updated 4 months ago
- Tool to create Submission Information Packages (SIP)☆24Jan 29, 2026Updated 2 weeks ago
- IIIF experiments with Gallica content☆32Nov 16, 2025Updated 3 months ago
- zramdrive bind util to move any directory to zram☆28Apr 20, 2019Updated 6 years ago
- A standalone React/Redux web application for for presenting unique printed books and manuscripts in digital facsimile.☆31Mar 10, 2023Updated 2 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Jul 4, 2025Updated 7 months ago
- Disable Target API Block☆26Oct 18, 2025Updated 3 months ago
- ☆10Feb 9, 2026Updated last week
- 【Android 11-13】为移动热点设置静态 IP☆10Mar 5, 2024Updated last year
- spaCy-compatible sm/md/lg/trf core models for Latin, i.e pipeline with POS tagger, morphologizer, lemmatizer, dependency parser, and NER☆12Aug 26, 2025Updated 5 months ago
- ☆12Aug 24, 2022Updated 3 years ago
- European Parliament website Python scraper☆12Oct 19, 2016Updated 9 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 3 months ago
- new reading environment for version 5.0 of the Perseus Digital Library☆93Updated this week
- Command-line tile downloader/assembler for IIIF endpoints/manifests☆35Jul 14, 2021Updated 4 years ago
- guides and test data for OCR4all☆32Oct 4, 2022Updated 3 years ago
- Conversions between various OCR formats☆82May 13, 2023Updated 2 years ago
- QT4 specifications☆38Feb 9, 2026Updated last week
- Document Image Binarization☆79Oct 17, 2024Updated last year
- Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.☆36Feb 9, 2026Updated last week
- OCR-D python tools☆33Aug 16, 2024Updated last year