trioptimum / pdf-ocr-scoreLinks
naive shell script to gauge the OCR quality of a PDF
☆18Updated 5 months ago
Alternatives and similar repositories for pdf-ocr-score
Users that are interested in pdf-ocr-score are comparing it to the libraries listed below
Sorting:
- Simple utilities to analyze and manipulate MARC files☆10Updated 2 weeks ago
- Test cases for validating BagIt implementations☆11Updated 2 years ago
- Find possible host names in a source text☆53Updated 2 years ago
- Illustrations☆26Updated last year
- A command line utility for listing and searching snapshots in web archives☆16Updated last year
- ☆24Updated 3 months ago
- Static Site Generator for Viewing Web Archives (in WACZ) format☆27Updated 2 years ago
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats☆48Updated last week
- Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.☆21Updated last year
- Homebrew formulae for digital preservation tools☆34Updated 5 months ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.☆35Updated 3 months ago
- Command line tool for digging into WARC files☆45Updated 3 weeks ago
- Web Archiving Course☆23Updated last year
- A framework for creating digital exhibits from a folder of files and a spreadsheet. See Readme below for instructions to get started!☆104Updated 2 months ago
- A font set based on the 10th-century Exeter Book script☆29Updated 3 years ago
- NARA digital preservation file format risk analysis and preservation plans☆228Updated last month
- Collaborative bibliography on 'experimental writing' and 'financial crisis' from the Mute archive - http://metamute.org/archive☆10Updated 2 years ago
- mirror a website, put it in a bag☆25Updated 2 years ago
- Archives for Black Lives in Philly was inspired by Jarrett Drake, Digital Archivist at Princeton University, and his work to end archives…☆18Updated 3 years ago
- ☆24Updated 2 years ago
- POD Aggregator, f.k.a. the POD Data Lake☆14Updated this week
- A static site generator for Mastodon exports☆50Updated 10 months ago
- Delightful Static Digital Library projects and resources☆34Updated 2 years ago
- A codebase to support a pure JSON search engine requiring no backend for any XHTML5 document collection☆59Updated last month
- Siegfried-based characterization tool for directories and disk images☆83Updated 7 months ago
- Download digitized books from Internet Archive and view with IIIF, locally and offline.☆40Updated last year
- A tool for collection archival slivers of the web and web archives☆14Updated 5 months ago
- 🗃️ Managing the safe, long-term storage of our digital collections in the cloud☆35Updated 3 months ago
- A browser extension to detect IIIF resources for Chrome and Firefox☆17Updated last year
- recursively deduplicate a directory and write its contents to a new directory while remembering the old paths☆49Updated 4 years ago