pdf-association / safedocs
Artifacts from the DARPA-funded SafeDocs research program
☆24Updated last year
Alternatives and similar repositories for safedocs:
Users that are interested in safedocs are comparing it to the libraries listed below
- A vendor- and implementation-independent specification-derived, machine-readable model of PDF.☆84Updated last week
- Industry-based resolutions for issues and errata reported against any PDF-related specification☆71Updated 3 weeks ago
- Targeted PDFs demonstrating commonly seen PDF differentials and interoperability issues☆12Updated 2 months ago
- CDXJ Indexing of WARC/ARCs☆25Updated 4 months ago
- ☆14Updated last year
- ☆10Updated 3 years ago
- XML Schema for Digital Forensics XML☆35Updated 2 months ago
- PDF Name Registry☆19Updated 2 weeks ago
- An interpretation of the content stream structure as described by ISO 32000☆11Updated 2 years ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆53Updated 2 years ago
- A command line utility for listing and searching snapshots in web archives☆16Updated last year
- PDF 2.0 example files☆91Updated 3 months ago
- Selected code and data for The Online Books Page and related applications☆11Updated 2 weeks ago
- A Github Action for turning Markdown into ReSpec HTML☆14Updated 10 months ago
- CLI implementation of httpreserve that can test links and retrieve internet archive replacements☆10Updated 4 months ago
- Efficient hOCR tooling☆44Updated last month
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆15Updated 3 years ago
- Command line tool for digging into WARC files☆39Updated 2 weeks ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Updated 2 years ago
- Auto-generated static web site digipres.org☆26Updated this week
- Digital Preservation of HTTP in documentary heritage.☆22Updated last year
- Fast PDF generation and compression. Deals with millions of pages daily.☆115Updated 8 months ago
- veraPDF GUI, CLI and installer☆83Updated 2 weeks ago
- Collections of individual rules and combined veraPDF validation profiles for various validation flavors☆15Updated 3 weeks ago
- A mirror of the PRONOM file format registry in Linked Open Data format. The Format Registry is a linked (open) data file format repositor…☆10Updated last year
- A tool for detecting viruses and NSFW material in WARC files☆13Updated 8 months ago
- Web archive index server based on RocksDB☆34Updated 4 months ago
- A tool for collection archival slivers of the web and web archives☆13Updated 2 months ago
- An open source set of decks for learning about digital preservation.☆23Updated 5 years ago
- A persistent repository for PRONOM Research Week activities☆12Updated 3 years ago