archivesunleashed / docker-autLinks
Docker image for the Archives Unleashed Toolkit
☆12Updated 2 years ago
Alternatives and similar repositories for docker-aut
Users that are interested in docker-aut are comparing it to the libraries listed below
Sorting:
- WASAPI data transfer APIs☆47Updated 3 years ago
- utility to fetch provenance information from Internet Archive's Wayback Machine☆13Updated 3 years ago
- Prototype SOLR-powered web archive exploration UI.☆43Updated 5 years ago
- A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects☆46Updated 6 years ago
- No longer maintained. Please use conciliator instead.☆26Updated 5 years ago
- The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.☆148Updated last year
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- Lakesuperior, an alternative Fedora Repository implementation☆32Updated 3 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Open ONI (Open Online Newspaper Initiative) Django web app☆51Updated 6 months ago
- Research Object BagIt archive☆19Updated 2 years ago
- Internet Research Agency Facebook ads as structured data☆22Updated 5 years ago
- Some ideas on making Bags into Git repositories☆16Updated 10 years ago
- Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archive…☆26Updated 2 years ago
- Documents for the project Libraccess☆13Updated 10 years ago
- Distant Reader, a tool for using & understanding a corpus☆20Updated 2 years ago
- Crawl Archivematica's Archival Information Packages (AIP) and provide repository-wide reporting.☆13Updated this week
- Rails application with Blazegraph for managing controlled vocabularies in RDF.☆22Updated 2 years ago
- A curated list of awesome Jupyter projects and guides from the GLAM community.☆19Updated 4 years ago
- Mario is a metadata processing pipeline that will process data from various sources and write to Elasticsearch☆13Updated 2 years ago
- Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs☆11Updated 7 years ago
- Web application for distributed compute analysis of Archive-It web archive collections.☆20Updated 2 weeks ago
- ☆28Updated 7 years ago
- ☆29Updated 7 years ago
- rightsstatements.org data model☆13Updated 3 years ago
- A Python library and client implementing the ResourceSync web synchronization framework☆33Updated 3 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 9 years ago
- Web Archiving Course☆23Updated last year
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- Metadata ingestion system for Digital Public Library of America☆31Updated 3 weeks ago