nla / chronicrawl
Experimental continouous web crawler for web archiving
☆9Updated 2 years ago
Alternatives and similar repositories for chronicrawl:
Users that are interested in chronicrawl are comparing it to the libraries listed below
- Shepherding our web archives from crawl to access.☆10Updated last year
- Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs☆11Updated 6 years ago
- ☆12Updated this week
- A command line utility for listing and searching snapshots in web archives☆16Updated last year
- Web application for distributed compute analysis of Archive-It web archive collections.☆17Updated last month
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆23Updated 4 months ago
- Command line tool for digging into WARC files☆39Updated 2 weeks ago
- Rails application for the Archives Unleashed Cloud.☆11Updated 3 years ago
- A client for the Archive-It And Webrecorder WASAPI Data Transfer API☆16Updated 5 years ago
- Web archive index server based on RocksDB☆34Updated 5 months ago
- ☆25Updated last year
- Fedora API Specification☆17Updated 3 years ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Updated 2 years ago
- Download digitized books from Internet Archive and view with IIIF, locally and offline.☆38Updated last year
- Django app for managing PREMIS Events☆14Updated 2 months ago
- OCFL tools in Python☆21Updated last week
- Ansible deployment of fedora 4, single or clustered on ubuntu 14.04☆10Updated 9 years ago
- Web Archiving Course☆21Updated last year
- ☆13Updated 5 years ago
- Scripts and configuration for converting MARC bibliographic records into RDF☆30Updated 5 years ago
- IIIF Examples and useful code☆18Updated 2 months ago
- Metadata Quality Assessment Framework API☆17Updated this week
- chrome extension to detect IIIF content in web pages☆20Updated 2 years ago
- Some ideas on making Bags into Git repositories☆16Updated 10 years ago
- utility to fetch provenance information from Internet Archive's Wayback Machine☆13Updated 2 years ago
- Rails application supporting the creation of OCR and the IIIF Content Search API☆34Updated 2 years ago
- rightsstatements.org data model☆12Updated 2 years ago
- ☆16Updated last month
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats☆47Updated 8 months ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago