openpreserve / format-corpus
An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.
☆189Updated last year
Alternatives and similar repositories for format-corpus:
Users that are interested in format-corpus are comparing it to the libraries listed below
- veraPDF test corpus for ISO 19005 (PDF/A) and ISO 14289 (PDF/UA)☆75Updated 2 weeks ago
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆156Updated 3 months ago
- A persistent repository for PRONOM Research Week activities☆11Updated 3 years ago
- JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to …☆73Updated 2 months ago
- An index of PDF-centric corpora☆122Updated 3 weeks ago
- Auto-generated static web site digipres.org☆26Updated this week
- Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store …☆25Updated 9 months ago
- Integrate handcrafted binary and documentation☆37Updated 10 months ago
- Carefully curated list of awesome digital preservation resources.☆83Updated last week
- ☆30Updated this week
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆12Updated 2 years ago
- Corpus of Digital Camera files.☆24Updated 7 months ago
- File validation and characterisation.☆176Updated this week
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆84Updated 2 months ago
- all my knowledge in one place, for teaching/sharing/learning☆20Updated last week
- Digital preservation policies and strategies☆11Updated 10 months ago
- Loader software for automated imaging of optical media with Nimbie disc robot☆34Updated 5 months ago
- Archive Research Services Workshop☆31Updated 7 years ago
- RE-lab is a joint effort of gimp.ru team and developers of various open source projects to do clean-room reverse engineering of various p…☆78Updated 3 years ago
- List of analog media inspection templates/forms.☆18Updated 3 years ago
- An open source set of decks for learning about digital preservation.☆23Updated 5 years ago
- Prototype wikidata portal project.☆10Updated 9 months ago
- A vendor- and implementation-independent specification-derived, machine-readable model of PDF.☆80Updated 3 weeks ago
- visualizations/charts for media collections, based on mediainfo☆14Updated 2 years ago
- A utility for staging files, calculating and validating file checksums, and comparing checksum values between storage locations.☆15Updated last year
- ☆16Updated 9 months ago
- Single server/laptop grade file-observatory☆10Updated last year
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- CDXJ Indexing of WARC/ARCs☆25Updated 2 months ago
- scripts for automating QCTools actions☆11Updated 2 months ago