A Python tool to search for and remove duplicated files in messy datasets
☆16Dec 23, 2024Updated last year
Alternatives and similar repositories for deduplify
Users that are interested in deduplify are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 10, 2019Updated 6 years ago
- The main repository for the Turing Commons platform☆20Mar 20, 2025Updated last year
- DaSCH application suite for the DaSCH Service Platform☆13Updated this week
- Materials for Turing's Research Data Science course☆31Dec 4, 2025Updated 3 months ago
- Establishing cross-community collaborations and promoting open research in data science☆24Aug 13, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A merged read deduplication tool capable to perform merged read deduplication on single end data.☆12Sep 4, 2024Updated last year
- The International Image Interoperability Framework (IIIF) Audio/Visual (A/V) Technical Specification Group aims to extend to A/V the bene…☆14Jun 5, 2017Updated 8 years ago
- annonatate☆12Nov 27, 2023Updated 2 years ago
- Deduplication for cfDNA sequencing data☆11Jul 5, 2017Updated 8 years ago
- a Mirador plugin that adds image manipulation tools to the user interface☆12Oct 13, 2025Updated 5 months ago
- Get a list of deduped files on a ZFS filesystem☆13Oct 14, 2020Updated 5 years ago
- Generate topic models from open text extracted from files in disk images☆10Apr 11, 2023Updated 2 years ago
- Create knowledge graphs with Markdown☆73Jan 9, 2025Updated last year
- The Data Science and AI Educators' Programme☆33Jul 3, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A paper-based game tailored addressing data literacy needs of the advocacy community.☆10Dec 12, 2025Updated 3 months ago
- Policy Skills Award project with TPS and Skills team - Professionalising traditional and infrastructure research roles in data science☆11Oct 13, 2024Updated last year
- Repo of the Turing's Humanities & Data Science Discussion Group☆13Jul 21, 2022Updated 3 years ago
- Turn a folder of images into a working IIIF setup – in a minute or less!☆60Mar 3, 2026Updated 3 weeks ago
- Repository for revision of PREMIS OWL ontology group☆13May 12, 2022Updated 3 years ago
- Documentation for Project Electron☆14Dec 2, 2024Updated last year
- PHP package for the IIIF Image API 3☆20Jan 12, 2026Updated 2 months ago
- ☆11Oct 6, 2020Updated 5 years ago
- Python language binding for the Preservica API☆21Mar 2, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Creates MPEG-DASH and HLS streams from a video file.☆19Jun 25, 2019Updated 6 years ago
- ☆10Jul 15, 2022Updated 3 years ago
- String deduplication package for Go☆19Jan 10, 2024Updated 2 years ago
- 文档去重功能是为了解决搜索引擎的文档语义重复的问题,方法是多重哈希下的语义指纹算法。☆12Aug 17, 2013Updated 12 years ago
- Marble is the design system of The Metropolitan Museum of Art 🏛☆21Mar 6, 2023Updated 3 years ago
- ☆14Jul 8, 2025Updated 8 months ago
- Scripts using NLP for processing born-digital archival collections☆10Feb 18, 2019Updated 7 years ago
- ☆16Jun 20, 2024Updated last year
- Dutch Digital Heritage Network virtual research environment set up and provisioning☆16Jan 21, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The text for https://www.tiki-toki.com/timeline/entry/1753034/A-History-of-Research-Ethics/☆19Nov 1, 2023Updated 2 years ago
- ☆16Feb 6, 2025Updated last year
- Find duplicate text files.☆15Jan 14, 2025Updated last year
- A directory of companies, people, and projects that are Open Source and from Berlin☆11May 3, 2017Updated 8 years ago
- Command-line tools to support meta-analysis using a library managed in Zotero☆11Feb 9, 2023Updated 3 years ago
- The Collection Services Manual for the Stuart A. Rose Manuscript, Archives, and Rare Book Library at Emory University☆11Mar 9, 2026Updated 2 weeks ago
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago