openpreserve / matchboxLinks
Image comparison QA tool for digital preservation workflows.
☆14Updated 10 years ago
Alternatives and similar repositories for matchbox
Users that are interested in matchbox are comparing it to the libraries listed below
Sorting:
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆157Updated 4 months ago
- A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects☆46Updated 5 years ago
- WASAPI data transfer APIs☆45Updated 3 years ago
- A web application developed by Zepheira for the Library of Congress National Digital Information Infrastructure and Preservation Program …☆45Updated last year
- File Information Tool Set☆94Updated 4 months ago
- The International Image Interoperability Framework (IIIF) Audio/Visual (A/V) Technical Specification Group aims to extend to A/V the bene…☆14Updated 8 years ago
- Fcrepo4 webapp plus optional fcrepo dependencies☆13Updated 4 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Validator for the Image API☆37Updated 7 months ago
- YAMZ - a crowdsourced metadata dictionary. Latest version at: https://github.com/metadata-research/yamz/☆13Updated 2 years ago
- Colors in Library of Congress digital images.☆32Updated 7 years ago
- NARA File Analyzer and Metadata Harvester☆108Updated 8 years ago
- The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.☆144Updated last year
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated 4 months ago
- Docker image for the Archives Unleashed Toolkit☆12Updated 2 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Z39.50 toolkit for C☆44Updated last month
- Adds the ability to transcribe items using the Scripto library.☆17Updated 11 months ago
- The base class from which to create a CWRC-Writer XML editor.☆14Updated 2 years ago
- ☆41Updated 7 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 8 years ago
- File validation and characterisation.☆180Updated last week
- Experiments mining image collections using OpenCV☆64Updated 10 years ago
- New generation DH curation and visualization platform☆9Updated 9 months ago
- A Rails engine supporting the discovery of web archives.☆50Updated 2 years ago
- The DPLA Platform☆64Updated 6 years ago
- A desktop wrapper for Mirador and its environment, allowing use of local images.☆14Updated 6 years ago
- The Web Curator Tool is a tool for managing the selective web harvesting process. (moved from SourceForge). https://webcurator.slack.com …☆27Updated 2 years ago