Tools to process books in a cloud based pipeline system
☆65May 28, 2026Updated 2 weeks ago
Alternatives and similar repositories for bookpipeline
Users that are interested in bookpipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A VUE IIIF viewer☆15Jun 5, 2026Updated last week
- Selected code and data for The Online Books Page and related applications☆11Jun 1, 2026Updated last week
- ☆24Updated this week
- Command Line Interface for running 🤗 Transformers Image Classification locally☆19Jun 3, 2026Updated last week
- Tools for working with CSV files☆17Sep 19, 2012Updated 13 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- File detector, metadata collector and well-formedness checker tool☆18Jun 3, 2026Updated last week
- Wrapper for the kraken OCR engine☆12Jul 12, 2025Updated 11 months ago
- Documentation and use cases for ALTO XML☆42Sep 10, 2018Updated 7 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆17Updated this week
- Pre-Ingest Tool for creating submission information packages☆24Sep 13, 2024Updated last year
- ALTO XML schema - latest and all former versions☆55May 29, 2026Updated 2 weeks ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Jan 17, 2020Updated 6 years ago
- In-broswer OCR editing program that transforms OCR results into structured, citable TEI. No XML experience required!☆30Sep 9, 2021Updated 4 years ago
- A static site generator for TEI Publisher☆13Mar 8, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reading mdict files, support MDX/MDD file formats.☆18Feb 3, 2026Updated 4 months ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆60Mar 20, 2026Updated 2 months ago
- Note: the repo has been moved to https://gitlab.com/readcoop/Transkribus/TranskribusSwtGui☆18Oct 27, 2020Updated 5 years ago
- Goobi viewer - Presentation software for digital libraries, museums, archives and galleries. Open Source.☆27Updated this week
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 6 months ago
- Docker setup for OCR4all bundled with Larex☆22Jan 29, 2024Updated 2 years ago
- IIIF experiments with Gallica content☆32Nov 16, 2025Updated 6 months ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Jan 26, 2022Updated 4 years ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The Weather Map is a visual model inspired by the synoptic weather charts for the map of controversies.☆11Feb 7, 2024Updated 2 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 7 months ago
- Share a view of a IIIF document with a short link☆14May 24, 2026Updated 2 weeks ago
- IIIF Timeliner☆11Feb 3, 2026Updated 4 months ago
- ☆18Oct 9, 2018Updated 7 years ago
- a Mirador 3 plugin that adds annotation creation tools to the user interface☆43Feb 12, 2026Updated 3 months ago
- Generate a IIIF manifest for a Wikipedia entry☆10Jun 7, 2018Updated 8 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- An extension for eXist-db that allows the reading and writing of MARC into and out from the database☆11Mar 6, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Flask web app that integrates Tesseract OCR to extract text from image files.☆10May 14, 2023Updated 3 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 6 years ago
- ☆10Updated this week
- A basic editor for samvera objects.☆10Feb 4, 2026Updated 4 months ago
- An open-source, browser-based front-end application for the collection of complex structured data from textual resources in history and t…☆17Jun 5, 2026Updated last week
- Master repository which includes most other OCR-D repositories as submodules☆73Jul 4, 2025Updated 11 months ago
- CVSNT-to-Git conversion utility☆13Feb 23, 2020Updated 6 years ago