pdf-association / safedocsLinks
Artifacts from the DARPA-funded SafeDocs research program
☆24Updated 2 years ago
Alternatives and similar repositories for safedocs
Users that are interested in safedocs are comparing it to the libraries listed below
Sorting:
- PDF Name Registry☆21Updated last week
- CDXJ Indexing of WARC/ARCs☆29Updated 10 months ago
- A vendor- and implementation-independent specification-derived, machine-readable model of PDF.☆89Updated last week
- Targeted PDFs demonstrating commonly seen PDF differentials and interoperability issues☆13Updated 5 months ago
- A Github Action for turning Markdown into ReSpec HTML☆14Updated last year
- XML Schema for Digital Forensics XML☆35Updated 8 months ago
- An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.☆200Updated 5 months ago
- A persistent repository for PRONOM Research Week activities☆12Updated 4 years ago
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆158Updated 7 months ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Updated 2 years ago
- A command line utility for listing and searching snapshots in web archives☆17Updated last year
- ☆14Updated last year
- File validation and characterisation.☆184Updated last week
- Web archive index server based on RocksDB☆36Updated this week
- Command line tool for digging into WARC files☆46Updated this week
- Selected code and data for The Online Books Page and related applications☆11Updated last month
- Auto-generated static web site digipres.org☆28Updated this week
- A Memento Aggregator CLI and Server in Go☆69Updated 7 months ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆53Updated 2 years ago
- DROID (Digital Record and Object Identification)☆342Updated this week
- signature-based file format identification☆249Updated last month
- A Python utility for creating PREMIS records from a CSV file☆13Updated last year
- PDF 2.0 example files☆100Updated 9 months ago
- ESSArch☆19Updated last week
- Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store …☆29Updated last year
- BitCurator Environment: Using, building, and maintaining BitCurator☆61Updated last year
- A tool for detecting viruses and NSFW material in WARC files☆17Updated last year
- Efficient hOCR tooling☆48Updated 2 months ago
- Single server/laptop grade file-observatory☆10Updated 2 years ago
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats☆48Updated 2 months ago