Living-with-machines / deduplifyLinks
A Python tool to search for and remove duplicated files in messy datasets
☆16Updated 5 months ago
Alternatives and similar repositories for deduplify
Users that are interested in deduplify are comparing it to the libraries listed below
Sorting:
- OpenAIRE Guidelines for Literature Repository Managers based on Dublin Core and DataCite Metadata Kernel☆13Updated last year
- Public-facing data for the US Archives RepoData project☆18Updated last year
- Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
- Library Carpentry: OpenRefine☆53Updated this week
- Bagit-based data packaging specification for dissemination of research data with useful human and machine readable metadata: "Make Data C…☆39Updated 6 years ago
- ☆28Updated 7 years ago
- Web application for distributed compute analysis of Archive-It web archive collections.☆18Updated 2 months ago
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- Jupyter notebooks with examples of querying different PID graphs and providers like OpenAlex, FREYA PID Graph, OpenAIRE, ORCID, ROR, Cros…☆24Updated 2 years ago
- OpenRefine reconciler for Research Organization Registry☆13Updated 2 months ago
- Tidy data for librarians☆24Updated this week
- Metadata Quality Assessment Framework API☆18Updated this week
- ☆29Updated 7 years ago
- ☆62Updated 2 years ago
- A platform-agnostic, configurable, and brandable SPARQL editor and visualization interface.☆13Updated last month
- DC Tabular Application Profile☆34Updated 7 months ago
- curation workflow automation and coordination☆42Updated 4 months ago
- Python for Librarians☆21Updated 6 years ago
- Source code of BARTOC.org user interface☆25Updated last month
- DEPRECATED - no longer actively maintained. Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRef…☆19Updated 5 years ago
- ☆12Updated last year
- LD4P Sinopia Project repo to hold docs, general issues, schemas, and related spec docs.☆22Updated 2 years ago
- Library Carpentry: Introduction to Working with Data (Regular Expressions)☆32Updated this week
- For working on the recipes☆40Updated this week
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Small Python library to validate persistent identifiers used in scholarly communication.☆29Updated 3 weeks ago
- A curated list of awesome Jupyter projects and guides from the GLAM community.☆19Updated 3 years ago
- Sinopia Linked Data Editor☆38Updated this week
- Light-weight Linked Open Data native cataloguing and crowdsourcing platform☆18Updated 3 months ago
- Python library to make creation of CIDOC CRM easier by mapping classes/predicates to python objects☆52Updated last year