Command line tool for deduplicating CSV files
☆434Mar 31, 2020Updated 6 years ago
Alternatives and similar repositories for csvdedupe
Users that are interested in csvdedupe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,485Jul 29, 2025Updated 11 months ago
- Examples for using the dedupe library☆417Aug 10, 2024Updated last year
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,055Feb 21, 2024Updated 2 years ago
- Demonstration of how dedupe might be used as geocoder☆17Jun 21, 2022Updated 4 years ago
- Pipeline for image classification at The Norwegian National Museum and zooming display mechanism.☆14Nov 3, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A list of free data matching and record linkage software.☆406Feb 21, 2024Updated 2 years ago
- Investigative tool for extracting relevant areas from many documents☆14Nov 17, 2015Updated 10 years ago
- Notebook and companion R script for the "R Basics: Stats" session at NICAR 2016.☆11Mar 13, 2016Updated 10 years ago
- Making Data, the DataMade Way☆290Feb 3, 2021Updated 5 years ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Jul 9, 2020Updated 5 years ago
- For watching a set of URLs and notifying someone when something has changed.☆32Jun 12, 2017Updated 9 years ago
- A toolkit for making domain-specific probabilistic parsers