pdf-association / safedocs
Artifacts from the DARPA-funded SafeDocs research program
☆22Updated last year
Alternatives and similar repositories for safedocs:
Users that are interested in safedocs are comparing it to the libraries listed below
- A vendor- and implementation-independent specification-derived, machine-readable model of PDF.☆79Updated this week
- Industry-based resolutions for issues and errata reported against any PDF-related specification☆68Updated this week
- XML Schema for Digital Forensics XML☆36Updated 6 months ago
- PDF Name Registry☆18Updated this week
- veraPDF test corpus for ISO 19005 (PDF/A) and ISO 14289 (PDF/UA)☆74Updated 3 months ago
- A persistent repository for PRONOM Research Week activities☆11Updated 3 years ago
- CDXJ Indexing of WARC/ARCs☆23Updated last month
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆44Updated 2 years ago
- File validation and characterisation.☆173Updated last month
- ☆14Updated last year
- PDF 2.0 example files☆87Updated this week
- Auto-generated static web site digipres.org☆26Updated last month
- ☆10Updated 3 years ago
- An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.☆188Updated last year
- signature-based file format identification☆227Updated this week
- Open ONI (Open Online Newspaper Initiative) Django web app☆48Updated 6 months ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆12Updated 2 years ago
- An interpretation of the content stream structure as described by ISO 32000☆11Updated last year
- Single server/laptop grade file-observatory☆10Updated last year
- Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store …☆23Updated 8 months ago
- Siegfried-based characterization tool for directories and disk images☆84Updated last month
- Tools for helping you work with web platform archive downloads.☆17Updated 4 years ago
- Selected code and data for The Online Books Page and related applications☆10Updated 2 weeks ago
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆153Updated 2 months ago
- A Github Action for turning Markdown into ReSpec HTML☆13Updated 7 months ago
- Efficient hOCR tooling☆42Updated 4 months ago
- File-tests is test-suite for File tool. Previous home: https://fedorahosted.org/file-tests/☆19Updated last year
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆14Updated 3 years ago
- CLI implementation of httpreserve that can test links and retrieve internet archive replacements☆10Updated last month
- A command line utility for listing and searching snapshots in web archives☆15Updated last year