Fast PDF generation and compression. Deals with millions of pages daily.
☆141Mar 2, 2026Updated 3 months ago
Alternatives and similar repositories for archive-pdf-tools
Users that are interested in archive-pdf-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient hOCR tooling☆57Aug 18, 2025Updated 10 months ago
- ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones …☆304May 24, 2026Updated 3 weeks ago
- ☆24Dec 3, 2025Updated 6 months ago
- Docker for ScanTailor and ScanTailor Advanced☆14Mar 17, 2024Updated 2 years ago
- A Hypothes.is integration plugin for OJS☆12Mar 17, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Image Annotation Tool and Image Search☆17Apr 24, 2026Updated last month
- A Python library to add reconstructed pronunciations of Middle Chinese on Chinese texts☆11Mar 13, 2023Updated 3 years ago
- Misc iCE40 specific cores☆14Feb 13, 2023Updated 3 years ago
- Implementation of the Euclidean-Rhythms idea in the form of plugin☆14Apr 10, 2024Updated 2 years ago
- ☆16Jul 24, 2015Updated 10 years ago
- Convert ALTO XML to plain text + minimal metadata☆17Oct 17, 2024Updated last year
- Homebrew formula and App bundler for Scantailor (Advanced)☆181Jan 26, 2026Updated 4 months ago
- Mannheim library utilities☆27Dec 29, 2025Updated 5 months ago
- JBIG2 Encoder☆70Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CollectionBuilder-CSV is a "stand alone" template for creating digital collection and exhibit websites using Jekyll and a metadata CSV.☆41Updated this week
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Feb 2, 2024Updated 2 years ago
- Named Entity Recognition☆19Feb 13, 2026Updated 4 months ago
- A tool that democratizes and standardizes access to Web APIs.☆14Mar 2, 2023Updated 3 years ago
- API client for Aleph, supports bulk entity and document upload.☆30Mar 5, 2026Updated 3 months ago
- Raspberry Pi image for controlling a DIYBookScanner via spreads☆37Jun 12, 2015Updated 11 years ago
- Conversions between various OCR formats☆84Feb 13, 2026Updated 4 months ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆60Apr 16, 2021Updated 5 years ago
- Orbtrace & supporting hardware circuit diagrams etc.☆26Oct 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Web application for transcribing OCR ground truth from Archive.org☆18Feb 22, 2018Updated 8 years ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆258Apr 7, 2026Updated 2 months ago
- Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust. *This is the Python fork/port* from https://g…☆22Nov 10, 2025Updated 7 months ago
- Automatic de-keystoning for single camera DIY book scanners.☆51Aug 15, 2020Updated 5 years ago
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Scan Tailor Experimental is an interactive post-processing tool for scanned pages.☆132May 4, 2026Updated last month
- A collection of Python scripts for interacting with ArchivesSpace using ArchivesSnake. See README for instructions on use and planned imp…☆10Jun 24, 2020Updated 5 years ago
- Lua binding for the lol-HTML rewriter/parser☆19Nov 14, 2020Updated 5 years ago
- Fix making Nexus 4 navigation buttons working after enigmatic hardware issue☆10Nov 3, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Library for reading the Macintosh File System☆11May 7, 2016Updated 10 years ago
- Vue-based Web Component for creating narrative presentations of images and maps☆15May 1, 2025Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆413Aug 10, 2024Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆202May 21, 2025Updated last year
- Gamera 3 for Python 2 (deprecated)☆39Aug 15, 2022Updated 3 years ago
- Tools for working with book data☆20Nov 25, 2025Updated 6 months ago
- Document Layout Analysis☆405Updated this week