openpreserve / matchboxLinks
Image comparison QA tool for digital preservation workflows.
☆14Updated 11 years ago
Alternatives and similar repositories for matchbox
Users that are interested in matchbox are comparing it to the libraries listed below
Sorting:
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆159Updated 2 weeks ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 9 years ago
- The International Image Interoperability Framework (IIIF) Audio/Visual (A/V) Technical Specification Group aims to extend to A/V the bene…☆14Updated 8 years ago
- NARA File Analyzer and Metadata Harvester☆112Updated 9 years ago
- A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects☆45Updated 6 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated 2 years ago
- Colors in Library of Congress digital images.☆32Updated 8 years ago
- File Information Tool Set☆101Updated last month
- Experiments mining image collections using OpenCV☆64Updated 10 years ago
- A web application developed by Zepheira for the Library of Congress National Digital Information Infrastructure and Preservation Program …☆45Updated last year
- ☆42Updated 7 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 4 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Updated 4 months ago
- YAMZ - a crowdsourced metadata dictionary. Latest version at: https://github.com/metadata-research/yamz/☆13Updated 3 years ago
- Validator for the Image API☆38Updated 4 months ago
- DPF Manager: Digital Preservation Formats Manager (Image files)☆33Updated 2 years ago
- Open ONI (Open Online Newspaper Initiative) Django web app☆52Updated 10 months ago
- The Canadian Writing Research Collaboratory (CWRC) is developing an in-browser text markup editor (CWRCWriter) for use by collaborative s…☆24Updated 7 years ago
- IIIF Image API reference implementation and Python library☆57Updated 4 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆64Updated 3 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆64Updated 5 months ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 10 years ago
- Sickle: OAI-PMH for Humans☆115Updated 2 years ago
- ☆18Updated 7 years ago
- Docker image for the Archives Unleashed Toolkit☆12Updated 3 years ago
- Adds the ability to transcribe items using the Scripto library.☆17Updated 5 months ago
- Aliada tool implementation☆35Updated 8 years ago
- The Bagger application packages data files according to the BagIt specification.☆136Updated 3 years ago
- OAI-PMH plugin for Solr☆23Updated 4 years ago
- JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to …☆78Updated 2 months ago