openpreserve / format-corpus
An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.
☆182Updated 9 months ago
Related projects: ⓘ
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆145Updated last week
- Corpus of Digital Camera files.☆16Updated 2 months ago
- veraPDF test corpus for ISO 19005 (PDF/A) and ISO 14289 (PDF/UA)☆70Updated 3 weeks ago
- A persistent repository for PRONOM Research Week activities☆11Updated 3 years ago
- JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to …☆69Updated 5 months ago
- Integrate handcrafted binary and documentation☆37Updated 5 months ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆12Updated last year
- Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store …☆23Updated 4 months ago
- signature-based file format identification☆223Updated last month
- Test files for conformance testing and benchmarking Jpylyzer.☆13Updated 5 months ago
- Loader software for automated imaging of optical media with Nimbie disc robot☆33Updated 2 weeks ago
- File validation and characterisation.☆168Updated last week
- Auto-generated static web site digipres.org☆26Updated this week
- An index of PDF-centric corpora☆97Updated 2 months ago
- RE-lab is a joint effort of gimp.ru team and developers of various open source projects to do clean-room reverse engineering of various p…☆76Updated 3 years ago
- Carefully curated list of awesome digital preservation resources.☆67Updated last week
- Siegfried-based characterization tool for directories and disk images☆82Updated 10 months ago
- ☆27Updated this week
- Single server/laptop grade file-observatory☆10Updated last year
- Nanite - a friendly swarm of format-identifying robots.☆15Updated 5 months ago
- DROID (Digital Record and Object Identification)☆275Updated 3 weeks ago
- File Information Tool Set☆91Updated last month
- DEPRECATED. Replaced with Electron desktop application: https://github.com/bulk-reviewer/bulk-reviewer☆13Updated 5 years ago
- The Next-Generation Architecture for Format-Aware Characterization.☆13Updated 2 years ago
- NARA digital preservation file format risk analysis and preservation plans☆200Updated last week
- The objective of this script is to allow archivists to find groups of records that may be inactive because of their age.☆10Updated 7 years ago
- Test files for the OpenJPEG libraries and utilities☆43Updated 3 weeks ago
- Wrapper around hfsutils to generate DFXML for HFS-formatted disk images☆12Updated 6 years ago
- A utility for staging files, calculating and validating file checksums, and comparing checksum values between storage locations.☆15Updated last year
- Prototype wikidata portal project.☆11Updated 4 months ago