pdf-association / safedocsLinks
Artifacts from the DARPA-funded SafeDocs research program
☆24Updated 2 years ago
Alternatives and similar repositories for safedocs
Users that are interested in safedocs are comparing it to the libraries listed below
Sorting:
- A vendor- and implementation-independent specification-derived, machine-readable model of PDF.☆85Updated 3 weeks ago
- Targeted PDFs demonstrating commonly seen PDF differentials and interoperability issues☆12Updated last month
- ☆10Updated 3 years ago
- veraPDF test corpus for ISO 19005 (PDF/A) and ISO 14289 (PDF/UA)☆79Updated 3 weeks ago
- CDXJ Indexing of WARC/ARCs☆26Updated 6 months ago
- PDF Name Registry☆21Updated 2 weeks ago
- Industry-based resolutions for issues and errata reported against any PDF-related specification☆73Updated this week
- Collections of individual rules and combined veraPDF validation profiles for various validation flavors☆16Updated 2 weeks ago
- XML Schema for Digital Forensics XML☆35Updated 4 months ago
- A tool for detecting viruses and NSFW material in WARC files☆15Updated 10 months ago
- PDF 2.0 example files☆94Updated 5 months ago
- CLI implementation of httpreserve that can test links and retrieve internet archive replacements☆10Updated 7 months ago
- Auto-generated static web site digipres.org☆27Updated this week
- A persistent repository for PRONOM Research Week activities☆12Updated 4 years ago
- Web archive index server based on RocksDB☆34Updated last month
- ☆14Updated last year
- A mirror of the PRONOM file format registry in Linked Open Data format. The Format Registry is a linked (open) data file format repositor…☆10Updated 2 years ago
- Command line tool for digging into WARC files☆40Updated 2 weeks ago
- simple script to convert web resources to a single warc file☆21Updated 2 years ago
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆157Updated 3 months ago
- A command line utility for listing and searching snapshots in web archives☆16Updated last year
- Efficient hOCR tooling☆44Updated last month
- Digital Preservation of HTTP in documentary heritage.☆22Updated 2 years ago
- A Github Action for turning Markdown into ReSpec HTML☆14Updated last year
- An open source set of decks for learning about digital preservation.☆23Updated 5 years ago
- An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.☆196Updated 3 weeks ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Updated 2 years ago
- Web application for distributed compute analysis of Archive-It web archive collections.☆18Updated 3 months ago
- Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.☆21Updated last year
- ☆13Updated 2 months ago