Living-with-machines / deduplify
A Python tool to search for and remove duplicated files in messy datasets
☆16Updated 3 months ago
Alternatives and similar repositories for deduplify:
Users that are interested in deduplify are comparing it to the libraries listed below
- Public-facing data for the US Archives RepoData project☆17Updated last year
- OpenRefine reconciler for Research Organization Registry☆13Updated last week
- Source code of BARTOC.org user interface☆26Updated 2 weeks ago
- Web application to try out reconciliation services interactively☆13Updated last month
- OpenAIRE Guidelines for Literature Repository Managers based on Dublin Core and DataCite Metadata Kernel☆12Updated last year
- MediaScape project researching the utility of Generous Interfaces for audiovisual archives☆10Updated 2 months ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Bagit-based data packaging specification for dissemination of research data with useful human and machine readable metadata: "Make Data C…☆38Updated 5 years ago
- Instructions, exercises and example data sets for Annif hands-on tutorial☆40Updated 2 months ago
- OpenRefine for Social Science Data☆24Updated this week
- OpenCitations Meta Software is the software that manages OpenCitations Meta. OpenCitations Meta is the bibliographic database containing …☆9Updated 2 months ago
- An open source online storytelling platform for everyone. Built by Cogapp.☆27Updated last month
- Provides an analytics capability for FOLIO libraries☆16Updated 7 months ago
- The DDI Discovery Vocabulary, an RDF vocabulary for data description and discovery based on DDI☆25Updated last year
- Awesome AI in Libraries☆16Updated last year
- Small Python library to validate persistent identifiers used in scholarly communication.☆29Updated 2 weeks ago
- Named-Entity Recognition extension for OpenRefine☆27Updated 2 years ago
- ShEx interpreter for ShEx 2.0☆25Updated 2 years ago
- DEPRECATED - no longer actively maintained. Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRef…☆19Updated 5 years ago
- Python library to make creation of CIDOC CRM easier by mapping classes/predicates to python objects☆52Updated last year
- A curated list of software, tools, resources and projects by and for libraries.☆16Updated 4 years ago
- 💠 An index for linked open data & standard knowledge descriptions (ontologies, vocabularies, shapes, queries, mappings)☆42Updated last year
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- Documents for the project Libraccess☆13Updated 10 years ago
- Integrated CSV to RDF converter, using CSVW and nanopublications☆47Updated 11 months ago
- Jupyter notebooks with examples of querying different PID graphs and providers like OpenAlex, FREYA PID Graph, OpenAIRE, ORCID, ROR, Cros…☆23Updated 2 years ago
- Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
- Visual Studio Code SPARQL Notebook Extension☆28Updated last month
- ☆28Updated 7 years ago
- RDF describing Creative Commons licenses☆13Updated last year