internetarchive / Sparkling
Internet Archive's Sparkling Data Processing Library
☆13Updated this week
Alternatives and similar repositories for Sparkling:
Users that are interested in Sparkling are comparing it to the libraries listed below
- Command line tool for digging into WARC files☆39Updated this week
- Web application for distributed compute analysis of Archive-It web archive collections.☆18Updated last month
- ☆13Updated 2 weeks ago
- ☆14Updated last year
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- The Oxford Common File Layout (OCFL) specifications and website☆59Updated 5 months ago
- A Github Action for turning Markdown into ReSpec HTML☆14Updated 11 months ago
- Documentation for Project Electron☆13Updated 5 months ago
- An open source set of decks for learning about digital preservation.☆23Updated 5 years ago
- DC Tabular Application Profile☆34Updated 6 months ago
- ☆41Updated 6 years ago
- Web application to try out reconciliation services interactively☆13Updated 2 weeks ago
- Repository for versions of BIBFRAME ontology.☆53Updated 2 months ago
- Open-source tools for working with BIBFRAME (see: http://bibframe.org), by default BIBFRAME Lite (see: http://bibfra.me) and more general…☆24Updated 3 years ago
- VuFind Harvest Tools☆21Updated 6 months ago
- Prototype wikidata portal project.☆10Updated last year
- A LDP Implementation backed by BlazeGraph☆27Updated 7 years ago
- WASAPI data transfer APIs☆44Updated 3 years ago
- Siegfried-based characterization tool for directories and disk images☆84Updated 4 months ago
- Sinopia Linked Data Editor☆36Updated this week
- CDXJ Indexing of WARC/ARCs☆25Updated 4 months ago
- Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store …☆25Updated last year
- LD4P Sinopia Project repo to hold docs, general issues, schemas, and related spec docs.☆21Updated 2 years ago
- QA Catalogue – a metadata quality assessment tool for library catalogue records (MARC, PICA, UNIMARC)☆83Updated this week
- Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs☆11Updated 6 years ago
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats☆47Updated 9 months ago
- ☆61Updated 2 years ago
- MARC to RDF toolkit - converter and harvester through json mapping☆36Updated 9 years ago
- ☆35Updated last year
- Experimental continouous web crawler for web archiving☆9Updated 2 years ago