Living-with-machines / deduplifyLinks
A Python tool to search for and remove duplicated files in messy datasets
☆16Updated 6 months ago
Alternatives and similar repositories for deduplify
Users that are interested in deduplify are comparing it to the libraries listed below
Sorting:
- OpenAIRE Guidelines for Literature Repository Managers based on Dublin Core and DataCite Metadata Kernel☆13Updated last year
- Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
- A platform-agnostic, configurable, and brandable SPARQL editor and visualization interface.☆13Updated 2 months ago
- Library Carpentry: OpenRefine☆53Updated last week
- OpenRefine reconciler for Research Organization Registry☆13Updated 2 months ago
- ☆28Updated 7 years ago
- MediaScape project researching the utility of Generous Interfaces for audiovisual archives☆10Updated 4 months ago
- Light-weight Linked Open Data native cataloguing and crowdsourcing platform☆18Updated 4 months ago
- Tidy data for librarians☆24Updated last week
- Bagit-based data packaging specification for dissemination of research data with useful human and machine readable metadata: "Make Data C…☆39Updated 6 years ago
- Jupyter notebooks with examples of querying different PID graphs and providers like OpenAlex, FREYA PID Graph, OpenAIRE, ORCID, ROR, Cros…☆24Updated 2 years ago
- A curated list of awesome Jupyter projects and guides from the GLAM community.☆19Updated 3 years ago
- Source code of BARTOC.org user interface☆25Updated last week
- An open source set of decks for learning about digital preservation.☆23Updated 5 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- ☆12Updated last year
- LD4P Sinopia Project repo to hold docs, general issues, schemas, and related spec docs.☆22Updated 2 years ago
- Python for Librarians☆21Updated 6 years ago
- DatAasee - A Metadata-Lake for Libraries☆21Updated last month
- command line resource for working with digital primary sources☆27Updated 6 years ago
- ☆63Updated 2 years ago
- Web application for distributed compute analysis of Archive-It web archive collections.☆18Updated 3 months ago
- Python library to make creation of CIDOC CRM easier by mapping classes/predicates to python objects☆52Updated last year
- Provides an analytics capability for FOLIO libraries☆15Updated 10 months ago
- Oral History/Qualitative Interview Data Analysis and Publication Tool☆19Updated 2 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated 8 months ago
- Awesome AI in Libraries☆16Updated last year
- Public-facing data for the US Archives RepoData project☆18Updated last year
- IIIF Audio/Video Player☆14Updated last year
- DEPRECATED - no longer actively maintained. Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRef…☆19Updated 5 years ago