Living-with-machines / deduplifyLinks
A Python tool to search for and remove duplicated files in messy datasets
☆16Updated 6 months ago
Alternatives and similar repositories for deduplify
Users that are interested in deduplify are comparing it to the libraries listed below
Sorting:
- Bagit-based data packaging specification for dissemination of research data with useful human and machine readable metadata: "Make Data C…☆39Updated 6 years ago
- OpenAIRE Guidelines for Literature Repository Managers based on Dublin Core and DataCite Metadata Kernel☆13Updated last year
- Web application for distributed compute analysis of Archive-It web archive collections.☆19Updated 3 months ago
- Library Carpentry: OpenRefine☆53Updated last week
- Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
- Named-Entity Recognition extension for OpenRefine☆29Updated 2 years ago
- A curated list of software, tools, resources and projects by and for libraries.☆16Updated 5 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- DEPRECATED - no longer actively maintained. Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRef…☆19Updated 5 years ago
- An open source online storytelling platform for everyone. Built by Cogapp.☆28Updated 4 months ago
- A platform-agnostic, configurable, and brandable SPARQL editor and visualization interface.☆14Updated 2 weeks ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Python library to make creation of CIDOC CRM easier by mapping classes/predicates to python objects☆52Updated last year
- Small Python library to validate persistent identifiers used in scholarly communication.☆29Updated this week
- ☆63Updated 2 years ago
- Provides an analytics capability for FOLIO libraries☆15Updated 10 months ago
- OpenRefine reconciler for Research Organization Registry☆13Updated 3 months ago
- OpenRefine for Social Science Data☆25Updated last week
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- A curated list of various semantic web and linked data resources for heritage, humanities and art history practitioners.☆114Updated 2 years ago
- Library Carpentry: Introduction to Working with Data (Regular Expressions)☆32Updated last week
- Carnegie Hall Rose Archives maintains a series of scripts to transform its historical performance history data from its source in a SQL d…☆24Updated 8 months ago
- Metadata ingestion system for Digital Public Library of America☆31Updated last week
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- A framework for creating and displaying visual essays☆54Updated last year
- ☆29Updated 7 years ago
- Web application to try out reconciliation services interactively☆13Updated 3 weeks ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- Library Carpentry Wikidata☆25Updated last week
- DatAasee - A Metadata-Lake for Libraries☆21Updated 2 months ago