internetarchive / arch
Web application for distributed compute analysis of Archive-It web archive collections.
☆16Updated 6 months ago
Alternatives and similar repositories for arch:
Users that are interested in arch are comparing it to the libraries listed below
- Command line tool for digging into WARC files☆38Updated this week
- Web Archiving Course☆20Updated last year
- Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs☆11Updated 6 years ago
- A command line utility for listing and searching snapshots in web archives☆15Updated last year
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats☆47Updated 7 months ago
- Download digitized books from Internet Archive and view with IIIF, locally and offline.☆37Updated 10 months ago
- Rails application for the Archives Unleashed Cloud.☆11Updated 3 years ago
- ☆10Updated 3 years ago
- An open source set of decks for learning about digital preservation.☆23Updated 5 years ago
- WASAPI data transfer APIs☆44Updated 2 years ago
- Archive Research Services Workshop☆31Updated 7 years ago
- Carefully curated list of awesome digital preservation resources.☆85Updated last month
- Shepherding our web archives from crawl to access.☆10Updated last year
- ☆14Updated last year
- Siegfried-based characterization tool for directories and disk images☆84Updated 2 months ago
- Documentation for Project Electron☆13Updated 3 months ago
- A client for the Archive-It And Webrecorder WASAPI Data Transfer API☆16Updated 5 years ago
- A command line utility for converting MARC to CSV (and Parquet, etc)☆28Updated last month
- Django app for managing PREMIS Events☆14Updated last month
- Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.☆19Updated last year
- utility to fetch provenance information from Internet Archive's Wayback Machine☆13Updated 2 years ago
- Experimental continouous web crawler for web archiving☆9Updated 2 years ago
- CDXJ Indexing of WARC/ARCs☆25Updated 3 months ago
- A repository to organize materials from the AI4LAM Teach and Learning Working Group☆14Updated last year
- Collaborative collection development for web archives☆18Updated 5 years ago
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archive…☆24Updated 2 years ago