An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.
☆203Feb 16, 2026Updated 2 months ago
Alternatives and similar repositories for format-corpus
Users that are interested in format-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store …☆33Dec 17, 2025Updated 3 months ago
- File validation and characterisation.☆201Dec 4, 2025Updated 4 months ago
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆160Jan 22, 2026Updated 2 months ago
- Siegfried-based characterization tool for directories and disk images☆92Nov 28, 2025Updated 4 months ago
- Community Resource for Archivists and Librarians Scripting☆25Oct 14, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Scripts for performing various tasks with the ArchivesSpace API☆14Jun 27, 2024Updated last year
- Auto-generated static web site digipres.org☆30Mar 27, 2026Updated 2 weeks ago
- Test files for conformance testing and benchmarking Jpylyzer.☆18Apr 2, 2024Updated 2 years ago
- ☆12Jan 13, 2026Updated 3 months ago
- Wrapper around hfsutils to generate DFXML for HFS-formatted disk images☆11Apr 20, 2018Updated 7 years ago
- Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.☆22Jan 4, 2024Updated 2 years ago
- DBPTK Developer - library and command-line tool for execution of database preservation actions☆53Mar 24, 2026Updated 3 weeks ago
- ☆16Apr 29, 2024Updated last year
- A Github Action for turning Markdown into ReSpec HTML☆15Jun 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A list of resources for setting up an audio digitization workflow☆14Mar 17, 2017Updated 9 years ago
- This project has been archived and is no longer being developed or supported. The Curator's Workbench is an extensible digital collectio…☆24Jun 25, 2020Updated 5 years ago
- The Average Novel☆10Dec 2, 2017Updated 8 years ago
- ☆14Sep 6, 2019Updated 6 years ago
- NARA digital preservation file format risk analysis and preservation plans☆236Mar 26, 2026Updated 3 weeks ago
- A web application for human-friendly exploration of Archivematica METS files☆25Sep 20, 2020Updated 5 years ago
- Useful scripts☆16Apr 8, 2026Updated last week
- all my knowledge in one place, for teaching/sharing/learning☆22Feb 17, 2026Updated last month
- veraPDF test corpus for ISO 19005 (PDF/A) and ISO 14289 (PDF/UA)☆86Feb 9, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Suite of tools for automated quality assurance of audio migration processes.☆45May 7, 2018Updated 7 years ago
- The study group Bits and Bots accommodates digital preservation professionals seeking coding abilities. In this repository, you can find …☆42Feb 5, 2026Updated 2 months ago
- File detector, metadata collector and well-formedness checker tool☆18Feb 3, 2026Updated 2 months ago
- Website for America's Public Bible☆11Oct 1, 2020Updated 5 years ago
- ☆14Jan 3, 2024Updated 2 years ago
- tool for calibration and recording of analog audio sources☆30Mar 27, 2025Updated last year
- Add a Solr-backed search interface to Omeka.☆22Feb 6, 2021Updated 5 years ago
- SCOPE: An access interface for DIPs from Archivematica☆24Apr 9, 2026Updated last week
- Work with BagIt packages from Python.☆260Apr 8, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Thoughts toward and tutorial on corpus-driven narrative generation☆25Nov 5, 2020Updated 5 years ago
- Destroy tweets and likes.☆14May 26, 2020Updated 5 years ago
- This is project based on linux-minidisc to extract digital contents from MZ-RH1 to Mac☆11Feb 11, 2021Updated 5 years ago
- Verify size of ISO 9660 image against Volume Descriptor fields☆53Jun 7, 2022Updated 3 years ago
- CCA Digital Archives Processing Manual☆34Jan 7, 2026Updated 3 months ago
- signature-based file format identification☆260Jan 23, 2026Updated 2 months ago
- ☆28Jul 17, 2020Updated 5 years ago