Living-with-machines / deduplifyLinks
A Python tool to search for and remove duplicated files in messy datasets
☆16Updated 7 months ago
Alternatives and similar repositories for deduplify
Users that are interested in deduplify are comparing it to the libraries listed below
Sorting:
- ☆29Updated 7 years ago
- Library Carpentry: OpenRefine☆54Updated this week
- LD4P Sinopia Project repo to hold docs, general issues, schemas, and related spec docs.☆22Updated 2 years ago
- Oral History/Qualitative Interview Data Analysis and Publication Tool☆20Updated 2 years ago
- An open source online storytelling platform for everyone. Built by Cogapp.☆30Updated 5 months ago
- For working on the recipes☆41Updated this week
- Instructions, exercises and example data sets for Annif hands-on tutorial☆42Updated 2 weeks ago
- A curated list of awesome Jupyter projects and guides from the GLAM community.☆19Updated 3 years ago
- Documentation and Data related to the Linked Data and Wikidata Working Groups☆27Updated last year
- ☆63Updated 2 years ago
- OpenRefine reconciler for Research Organization Registry☆13Updated 4 months ago
- Files for the On The Books project☆35Updated 9 months ago
- Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
- ☆15Updated last year
- Python for Librarians☆21Updated 7 years ago
- A python client for the DPLA API☆44Updated 2 years ago
- Taxonomy of Digital Research Activities in the Humanities☆109Updated this week
- A framework for creating and displaying visual essays☆54Updated last year
- Public-facing data for the US Archives RepoData project☆18Updated last year
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- Development of a specification for linked data in museums, using existing ontologies and frameworks to build usable, understandable APIs☆107Updated 2 months ago
- A platform-agnostic, configurable, and brandable SPARQL editor and visualization interface.☆15Updated 2 weeks ago
- Grateful Data isn't programming code, but an online tutorial about data acquisition, cleaning and enriching, using publicly accessible da…☆59Updated 5 years ago
- Source code of BARTOC.org user interface☆26Updated 2 weeks ago
- A data validation tool for MARC records☆23Updated 3 months ago
- Mutual Muses is a crowdsourced transcription project undertaken by the Digital Art History program at the Getty Research Institute☆16Updated 7 years ago
- Tidy data for librarians☆24Updated this week
- ☆28Updated 7 years ago
- Repository for code developed for Saving Ukrainian Cultural Heritage Online (SUCHO) project☆7Updated 3 years ago
- A crowd-sourcing project by Cambridge Digital Library undertaken during the University of Cambridge's closure period due to the Coronavir…☆11Updated 9 months ago