edsu / dedoopLinks
recursively deduplicate a directory and write its contents to a new directory while remembering the old paths
☆49Updated 4 years ago
Alternatives and similar repositories for dedoop
Users that are interested in dedoop are comparing it to the libraries listed below
Sorting:
- Test cases for validating BagIt implementations☆11Updated 2 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 8 years ago
- A Lit web-component for viewing a Whisper JSON transcript file☆14Updated 9 months ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆39Updated 6 years ago
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats☆48Updated last month
- Download digitized books from Internet Archive and view with IIIF, locally and offline.☆40Updated last year
- Some ideas on making Bags into Git repositories☆16Updated 10 years ago
- Collaborative bibliography on 'experimental writing' and 'financial crisis' from the Mute archive - http://metamute.org/archive☆10Updated 2 years ago
- Pymarc Utilities is a set of functions aimed to help manuplating large size MARC files. Pymarc Utilities works with Pymarc library for wo…☆22Updated 4 months ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 10 years ago
- WASAPI data transfer APIs☆47Updated 3 years ago
- ☆14Updated last year
- MARC to RDF toolkit - converter and harvester through json mapping☆36Updated 10 years ago
- Web Archiving Course☆23Updated last year
- A MongoDB implementation of the W3C Web Annotation Protocol☆17Updated 3 years ago
- A command line utility for listing and searching snapshots in web archives☆16Updated last year
- Web application for distributed compute analysis of Archive-It web archive collections.☆20Updated 5 months ago
- command line resource for working with digital primary sources☆28Updated 7 years ago
- mirror a website, put it in a bag☆25Updated 2 years ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Updated 2 years ago
- A Rails engine for metadata aggregation, enhancement, and quality control.☆29Updated 8 years ago
- Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is des…☆157Updated 6 months ago
- Open ONI (Open Online Newspaper Initiative) Django web app☆51Updated 5 months ago
- List of all awesome Trusted Digital Repositories☆17Updated 3 years ago
- Shepherding our web archives from crawl to access.☆10Updated last year
- A Rails engine supporting discovery of archival material☆45Updated last month
- A commandline tool and Python library for archiving data from Facebook using the Graph API.☆78Updated 7 years ago
- a webapp for code4lib jobs☆40Updated 3 years ago
- A Jekyll-based static site generator for archival description in JSON.☆33Updated 10 months ago
- Public-facing data for the US Archives RepoData project☆17Updated last year