openpreserve / format-corpusLinks
An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.
☆201Updated 7 months ago
Alternatives and similar repositories for format-corpus
Users that are interested in format-corpus are comparing it to the libraries listed below
Sorting:
- JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to …☆77Updated last month
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆158Updated 9 months ago
- veraPDF test corpus for ISO 19005 (PDF/A) and ISO 14289 (PDF/UA)☆83Updated 3 weeks ago
- Integrate handcrafted binary and documentation☆36Updated 2 months ago
- Test files for the OpenJPEG libraries and utilities☆45Updated last year
- RE-lab is a joint effort of gimp.ru team and developers of various open source projects to do clean-room reverse engineering of various p…☆82Updated 4 years ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Updated 3 years ago
- A persistent repository for PRONOM Research Week activities☆12Updated 4 years ago
- Test files for conformance testing and benchmarking Jpylyzer.☆17Updated last year
- Auto-generated static web site digipres.org☆29Updated last month
- signature-based file format identification☆254Updated 3 months ago
- DROID (Digital Record and Object Identification)☆353Updated 2 weeks ago
- Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store …☆31Updated 3 weeks ago
- File validation and characterisation.☆192Updated last month
- XML Schema for Digital Forensics XML☆35Updated 11 months ago
- Loader software for automated imaging of optical media with Nimbie disc robot☆36Updated 9 months ago
- A fork of the disktype disk and disk image format detection tool☆11Updated 9 years ago
- ☆35Updated last month
- Corpus of Digital Camera files.☆27Updated last week
- NARA digital preservation file format risk analysis and preservation plans☆237Updated 2 weeks ago
- Single server/laptop grade file-observatory☆10Updated 2 years ago
- Prototype wikidata portal project.☆11Updated last year
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆90Updated 8 months ago
- Digital preservation policies and strategies☆12Updated last year
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆155Updated 3 months ago
- Carefully curated list of awesome digital preservation resources.☆115Updated 5 months ago
- Artifacts from the DARPA-funded SafeDocs research program☆25Updated 2 years ago
- A pure Python cleanroom implementation of libmagic, with instrumented parsing from Kaitai struct and an interactive hex viewer☆376Updated last month
- DEPRECATED. Replaced with Electron desktop application: https://github.com/bulk-reviewer/bulk-reviewer☆13Updated 6 years ago
- Tool for automated processing of disk images in BitCurator☆25Updated 9 months ago