Living-with-machines / deduplifyLinks
A Python tool to search for and remove duplicated files in messy datasets
☆16Updated 10 months ago
Alternatives and similar repositories for deduplify
Users that are interested in deduplify are comparing it to the libraries listed below
Sorting:
- ☆29Updated 7 years ago
- Mutual Muses is a crowdsourced transcription project undertaken by the Digital Art History program at the Getty Research Institute☆16Updated 7 years ago
- For working on the recipes☆42Updated this week
- A platform-agnostic, configurable, and brandable SPARQL editor and visualization interface.☆15Updated 2 weeks ago
- Library Carpentry: OpenRefine☆54Updated this week
- Simple command line oai-pmh harvester written in Python.☆41Updated 3 years ago
- Public-facing data for the US Archives RepoData project☆17Updated 2 years ago
- An open source online storytelling platform for everyone. Built by Cogapp.☆35Updated last week
- Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
- Files for the On The Books project☆35Updated last year
- Oral History/Qualitative Interview Data Analysis and Publication Tool☆20Updated 2 years ago
- LD4P Sinopia Project repo to hold docs, general issues, schemas, and related spec docs.☆22Updated 2 years ago
- A curated list of awesome Jupyter projects and guides from the GLAM community.☆19Updated 4 years ago
- ☆16Updated last year
- The Canadian Writing Research Collaboratory (CWRC) is developing an in-browser text markup editor (CWRCWriter) for use by collaborative s…☆24Updated 7 years ago
- Grateful Data isn't programming code, but an online tutorial about data acquisition, cleaning and enriching, using publicly accessible da…☆60Updated 6 years ago
- ☆60Updated 2 years ago
- OpenRefine reconciler for Research Organization Registry☆13Updated 7 months ago
- Locolligo is a single-page, browser-based javascript application to facilitate the formatting, linking, and geolocation of datasets, with…☆14Updated last year
- Bagit-based data packaging specification for dissemination of research data with useful human and machine readable metadata: "Make Data C…☆39Updated 6 years ago
- Tidy data for librarians☆26Updated this week
- Instructions, exercises and example data sets for Annif hands-on tutorial☆42Updated this week
- A curated list of various semantic web and linked data resources for heritage, humanities and art history practitioners.☆114Updated 2 years ago
- IIIF Presentation API 3 Python Library☆35Updated last month
- Python library to make creation of CIDOC CRM easier by mapping classes/predicates to python objects☆52Updated 2 years ago
- Documentation and Data related to the Linked Data and Wikidata Working Groups☆24Updated last year
- A framework for creating and displaying visual essays☆55Updated last year
- ☆47Updated this week
- A Jekyll-based static site generator for archival description in JSON.☆33Updated 2 months ago
- Taxonomy of Digital Research Activities in the Humanities☆111Updated 2 weeks ago