pdf-association / safedocsLinks
Artifacts from the DARPA-funded SafeDocs research program
☆25Updated 2 years ago
Alternatives and similar repositories for safedocs
Users that are interested in safedocs are comparing it to the libraries listed below
Sorting:
- PDF Name Registry☆22Updated last week
- A vendor- and implementation-independent specification-derived, machine-readable model of PDF.☆96Updated last week
- CDXJ Indexing of WARC/ARCs☆31Updated last year
- Auto-generated static web site digipres.org☆29Updated 2 weeks ago
- A command line utility for listing and searching snapshots in web archives☆17Updated 2 years ago
- signature-based file format identification☆255Updated this week
- XML Schema for Digital Forensics XML☆35Updated 11 months ago
- Targeted PDFs demonstrating commonly seen PDF differentials and interoperability issues☆14Updated 8 months ago
- A Github Action for turning Markdown into ReSpec HTML☆15Updated last year
- An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.☆201Updated 7 months ago
- DROID (Digital Record and Object Identification)☆361Updated this week
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆52Updated 3 years ago
- ☆14Updated 2 years ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Updated 3 years ago
- Selected code and data for The Online Books Page and related applications☆11Updated 3 weeks ago
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆159Updated 10 months ago
- Web archive index server based on RocksDB☆37Updated last week
- A fork of the disktype disk and disk image format detection tool☆11Updated 9 years ago
- File validation and characterisation.☆196Updated last month
- Analyze and help extract older "hidden" versions of a pdf from the current pdf.☆100Updated 3 years ago
- File-tests is test-suite for File tool. Previous home: https://fedorahosted.org/file-tests/☆21Updated last month
- A Python utility for creating PREMIS records from a CSV file☆14Updated last year
- A persistent repository for PRONOM Research Week activities☆12Updated 4 years ago
- Efficient hOCR tooling☆55Updated 5 months ago
- Convert HTTP Archive (HAR) -> Web Archive (WARC) format☆55Updated 7 years ago
- Industry-based resolutions for issues and errata reported against any PDF-related specification☆83Updated last week
- BitCurator Environment: Using, building, and maintaining BitCurator☆63Updated 2 years ago
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆16Updated 4 years ago
- search interface for scholarly works☆85Updated last year
- Centralised repository for WARC usage specifications.☆121Updated 3 months ago