Tools to process books in a cloud based pipeline system
☆64May 28, 2026Updated last month
Alternatives and similar repositories for bookpipeline
Users that are interested in bookpipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self hosting code for Recogito-Studio☆23Apr 13, 2026Updated 2 months ago
- Selected code and data for The Online Books Page and related applications☆11Jun 1, 2026Updated last month
- ☆24Jun 9, 2026Updated 3 weeks ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆203May 21, 2025Updated last year
- Tools for working with CSV files☆17Sep 19, 2012Updated 13 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- File detector, metadata collector and well-formedness checker tool☆18Jun 3, 2026Updated last month
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆17Jun 5, 2026Updated 3 weeks ago
- Given the URL to a public JSON document in an International Image Interoperability Framework (IIIF) image server, this script will downlo…☆17Sep 6, 2022Updated 3 years ago
- Pre-Ingest Tool for creating submission information packages☆24Sep 13, 2024Updated last year
- ALTO XML schema - latest and all former versions☆55May 29, 2026Updated last month
- Make a searchable pdf via Google Cloud Vision OCR☆14Jan 17, 2020Updated 6 years ago
- In-broswer OCR editing program that transforms OCR results into structured, citable TEI. No XML experience required!☆31Sep 9, 2021Updated 4 years ago
- A static site generator for TEI Publisher☆13Mar 8, 2022Updated 4 years ago
- Reading mdict files, support MDX/MDD file formats.☆18Feb 3, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆60Mar 20, 2026Updated 3 months ago
- Goobi workflow - Workflow management software for digitisation projects used in more than 80 cultural heritage institutions in at least 1…☆64Updated this week
- Decodes Compact Disc data from microscope images of a CD's surface☆12Jan 14, 2023Updated 3 years ago
- Goobi viewer - Presentation software for digital libraries, museums, archives and galleries. Open Source.☆27Jun 25, 2026Updated last week
- Process, enhance and evaluate multiple OCR output.☆24Dec 2, 2025Updated 7 months ago
- Digitization information system build on top of Fedora repository☆16Jan 15, 2019Updated 7 years ago
- Docker setup for OCR4all bundled with Larex☆22Jan 29, 2024Updated 2 years ago
- IIIF experiments with Gallica content☆32Nov 16, 2025Updated 7 months ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Jan 26, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A semantic image annotation tool for researchers, digital humanists and cultural heritage professionals.☆63Jun 26, 2026Updated last week
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 8 months ago
- Share a view of a IIIF document with a short link☆14May 24, 2026Updated last month
- IIIF Timeliner☆11Feb 3, 2026Updated 5 months ago
- ☆18Oct 9, 2018Updated 7 years ago
- a Mirador 3 plugin that adds annotation creation tools to the user interface☆44Feb 12, 2026Updated 4 months ago
- A IIIF static tile and manifest generator built using Python to generate IIIF tiled images and manifests. This application was put toget…☆10Jun 26, 2026Updated last week
- An Unofficial, Fanmade Build Creator/Planner for Cyberpunk 2077☆13Mar 15, 2024Updated 2 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [DEPRECATED] A blacklight application using SPARQL to replace Solr☆12Mar 27, 2020Updated 6 years ago
- An extension for eXist-db that allows the reading and writing of MARC into and out from the database☆11Mar 6, 2016Updated 10 years ago
- Builds a Simple Archive Format package from files and a spreadsheet☆48Apr 27, 2023Updated 3 years ago
- A Flask web app that integrates Tesseract OCR to extract text from image files.☆10May 14, 2023Updated 3 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 6 years ago
- ☆10Updated this week
- A basic editor for samvera objects.☆10Feb 4, 2026Updated 5 months ago