pdf-association / arlington-pdf-modelLinks
A vendor- and implementation-independent specification-derived, machine-readable model of PDF.
☆95Updated this week
Alternatives and similar repositories for arlington-pdf-model
Users that are interested in arlington-pdf-model are comparing it to the libraries listed below
Sorting:
- PDF Name Registry☆21Updated this week
- An index of PDF-centric corpora☆154Updated 6 months ago
- Industry-based resolutions for issues and errata reported against any PDF-related specification☆83Updated last month
- PDF 2.0 example files☆104Updated last year
- Artifacts from the DARPA-funded SafeDocs research program☆25Updated 2 years ago
- File validation and characterisation.☆195Updated last month
- An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.☆201Updated 7 months ago
- Industry supported, open source PDF/A validation library☆314Updated last week
- Targeted PDFs demonstrating commonly seen PDF differentials and interoperability issues☆14Updated 8 months ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆132Updated 2 weeks ago
- JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to …☆78Updated last month
- ALTO XML schema - latest and all former versions☆55Updated last month
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆158Updated 10 months ago
- Documentation and use cases for ALTO XML☆41Updated 7 years ago
- METS 1.x and METS 2 schemas☆25Updated 7 months ago
- veraPDF GUI, CLI and installer☆96Updated last week
- The hOCR Embedded OCR Workflow and Output Format☆75Updated last year
- Efficient hOCR tooling☆55Updated 5 months ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Updated 3 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆59Updated 3 months ago
- JBIG2 Encoder☆47Updated 3 weeks ago
- XSLT and XQuery Specifications - the source used to build the specs, and the errata☆39Updated 2 years ago
- Saxon XInclude processor☆12Updated 5 months ago
- Collections of individual rules and combined veraPDF validation profiles for various validation flavors☆19Updated last week
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆198Updated 8 months ago
- signature-based file format identification☆254Updated 4 months ago
- Crop And Splice Segments (of scanned pages)☆14Updated 6 years ago
- VSCode extension for highlighting XSLT and XPath (upto 3.0/3.1)☆46Updated 2 months ago
- Automatically exported from code.google.com/p/xspec☆40Updated 7 years ago
- DEPRECATED eXist code for Syriaca.org: The Syriac Reference Portal☆10Updated last year