internetarchive / arch
Web application for distributed compute analysis of Archive-It web archive collections.
☆15Updated 5 months ago
Alternatives and similar repositories for arch:
Users that are interested in arch are comparing it to the libraries listed below
- Command line tool for digging into WARC files☆38Updated this week
- Web Archiving Course☆20Updated 11 months ago
- Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs☆11Updated 6 years ago
- A command line utility for converting MARC to CSV (and Parquet, etc)☆28Updated last week
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- Django app for managing PREMIS Events☆14Updated last week
- BookOps WorldCat Metadata API wrapper☆38Updated this week
- WASAPI data transfer APIs☆43Updated 2 years ago
- Documentation for Project Electron☆13Updated 2 months ago
- An open source set of decks for learning about digital preservation.☆23Updated 5 years ago
- ☆28Updated 6 years ago
- ☆10Updated 3 years ago
- Pymarc Utilities is a set of functions aimed to help manuplating large size MARC files. Pymarc Utilities works with Pymarc library for wo…☆21Updated 6 months ago
- Download digitized books from Internet Archive and view with IIIF, locally and offline.☆36Updated 10 months ago
- ☆14Updated last year
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats☆47Updated 6 months ago
- Open ONI (Open Online Newspaper Initiative) Django web app☆48Updated 7 months ago
- MARC to RDF toolkit - converter and harvester through json mapping☆36Updated 9 years ago
- Rails application for the Archives Unleashed Cloud.☆11Updated 3 years ago
- Shepherding our web archives from crawl to access.☆10Updated last year
- Identify, review, and remove sensitive files☆29Updated last year
- Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archive…☆24Updated 2 years ago
- Tools used for harmful language description audit in Duke's Rubenstein Library, including binaries, documentation, and source code for pu…☆19Updated 2 years ago
- Provides an analytics capability for FOLIO libraries☆15Updated 5 months ago
- Open-source tools for working with BIBFRAME (see: http://bibframe.org), by default BIBFRAME Lite (see: http://bibfra.me) and more general…☆23Updated 3 years ago
- Siegfried-based characterization tool for directories and disk images☆84Updated 2 months ago
- Public-facing data for the US Archives RepoData project☆17Updated last year