Living-with-machines / deduplify
A Python tool to search for and remove duplicated files in messy datasets
☆15Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for deduplify
- OpenAIRE Guidelines for Literature Repository Managers based on Dublin Core and DataCite Metadata Kernel☆12Updated last year
- Public-facing data for the US Archives RepoData project☆17Updated last year
- OpenRefine reconciler for Research Organization Registry☆13Updated last year
- ☆12Updated 7 months ago
- ☆28Updated 6 years ago
- Awesome AI in Libraries☆16Updated last year
- An open source online storytelling platform for everyone. Built by Cogapp.☆25Updated last week
- The International Image Interoperability Framework (IIIF) Audio/Visual (A/V) Technical Specification Group aims to extend to A/V the bene…☆13Updated 7 years ago
- A standalone React/Redux web application for for presenting unique printed books and manuscripts in digital facsimile.☆32Updated last year
- ☆29Updated 6 years ago
- Humanities Data Curation Record☆11Updated 7 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated last year
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- command line resource for working with digital primary sources☆27Updated 6 years ago
- Patterns based on the W3C Web Annotation Model, primarily for use in linking resources describing historical phenomena with the places re…☆11Updated 4 years ago
- Jupyter notebooks with examples of querying different PID graphs and providers like OpenAlex, FREYA PID Graph, OpenAIRE, ORCID, ROR, Cros…☆22Updated last year
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- OpenRefine for Social Science Data☆23Updated this week
- World Historical Gazetteer platform☆18Updated 2 months ago
- Library Carpentry: OpenRefine☆52Updated this week
- Bagit-based data packaging specification for dissemination of research data with useful human and machine readable metadata: "Make Data C…☆38Updated 5 years ago
- Tidy data for librarians☆22Updated this week
- The Pelagios Exploration Engine☆21Updated 3 years ago
- A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects☆47Updated 5 years ago
- Named-Entity Recognition extension for OpenRefine☆24Updated last year
- A framework for creating and displaying visual essays☆52Updated 8 months ago
- Locolligo is a single-page, browser-based javascript application to facilitate the formatting, linking, and geolocation of datasets, with…☆14Updated 9 months ago
- A curated list of awesome Jupyter projects and guides from the GLAM community.☆19Updated 3 years ago