pdf-association / safedocsLinks
Artifacts from the DARPA-funded SafeDocs research program
☆25Updated 2 years ago
Alternatives and similar repositories for safedocs
Users that are interested in safedocs are comparing it to the libraries listed below
Sorting:
- PDF Name Registry☆22Updated 2 months ago
- A vendor- and implementation-independent specification-derived, machine-readable model of PDF.☆87Updated last week
- XML Schema for Digital Forensics XML☆36Updated 6 months ago
- File validation and characterisation.☆184Updated last week
- CDXJ Indexing of WARC/ARCs☆28Updated 8 months ago
- DROID (Digital Record and Object Identification)☆331Updated last week
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Updated 2 years ago
- Selected code and data for The Online Books Page and related applications☆11Updated 3 weeks ago
- A command line utility for listing and searching snapshots in web archives☆16Updated last year
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆157Updated 5 months ago
- signature-based file format identification☆246Updated 4 months ago
- BitCurator Environment: Using, building, and maintaining BitCurator☆58Updated last year
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆53Updated 2 years ago
- Targeted PDFs demonstrating commonly seen PDF differentials and interoperability issues☆13Updated 3 months ago
- Web archive index server based on RocksDB☆34Updated last month
- A Github Action for turning Markdown into ReSpec HTML☆14Updated last year
- Single server/laptop grade file-observatory☆10Updated 2 years ago
- Powerful Python tool to analyze PDF documents☆26Updated 3 years ago
- Command line tool for digging into WARC files☆45Updated last week
- A Memento Aggregator CLI and Server in Go☆68Updated 5 months ago
- A persistent repository for PRONOM Research Week activities☆12Updated 4 years ago
- Specifications developed and maintained by the Webrecorder community.☆136Updated 7 months ago
- Classic LOCKSS System (LOCKSS 1.x)☆67Updated this week
- Converts WARC files to static HTML☆47Updated last year
- ☆14Updated last year
- An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.☆200Updated 2 months ago
- Documentation for Project Electron☆13Updated 8 months ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆120Updated last week
- A fork of the disktype disk and disk image format detection tool☆10Updated 8 years ago
- Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.☆21Updated last year