pdf-association / safedocs
Artifacts from the DARPA-funded SafeDocs research program
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for safedocs
- ☆14Updated 10 months ago
- PDF Name Registry☆18Updated 3 months ago
- A vendor- and implementation-independent specification-derived, machine-readable model of PDF.☆77Updated this week
- A persistent repository for PRONOM Research Week activities☆11Updated 3 years ago
- Industry-based resolutions for issues and errata reported against any PDF-related specification☆66Updated this week
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆12Updated last year
- File validation and characterisation.☆171Updated this week
- Auto-generated static web site digipres.org☆26Updated last week
- XML Schema for Digital Forensics XML☆36Updated 5 months ago
- Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store …☆23Updated 6 months ago
- PDF 2.0 example files☆83Updated 6 months ago
- CDXJ Indexing of WARC/ARCs☆21Updated last week
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆148Updated last week
- DROID (Digital Record and Object Identification)☆284Updated last week
- ☆28Updated last week
- JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to …☆69Updated 7 months ago
- Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.☆19Updated 10 months ago
- Targeted PDFs demonstrating commonly seen PDF differentials and interoperability issues☆9Updated last month
- BitCurator Environment: Using, building, and maintaining BitCurator☆53Updated 10 months ago
- veraPDF GUI, CLI and installer☆65Updated this week
- signature-based file format identification☆224Updated 3 weeks ago
- An index of PDF-centric corpora☆110Updated last month
- Web application for distributed compute analysis of Archive-It web archive collections.☆15Updated 2 months ago
- CLI implementation of httpreserve that can test links and retrieve internet archive replacements☆10Updated this week
- Documentation for Project Electron☆13Updated last year
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆43Updated last year
- simple script to convert web resources to a single warc file☆18Updated last year
- A Github Action for turning Markdown into ReSpec HTML☆13Updated 5 months ago
- visualizations/charts for media collections, based on mediainfo☆14Updated 2 years ago