A simple command line interface to the datamade/dedupe library.
☆43Dec 26, 2022Updated 3 years ago
Alternatives and similar repositories for pgdedupe
Users that are interested in pgdedupe are comparing it to the libraries listed below
Sorting:
- Demonstration of how dedupe might be used as geocoder☆17Jun 21, 2022Updated 3 years ago
- Discover and parse results for jurisdictions that use Clarity-based election systems.☆38Nov 12, 2025Updated 4 months ago
- A Django 2.1 project to reproduce WebKit Bug 188165 and Django Ticket #30250☆15Mar 29, 2019Updated 6 years ago
- A demonstration application showcasing SF's updated building footprints☆19Oct 14, 2019Updated 6 years ago
- GPTBundle, a React application toolkit, harnesses AI to convert textual content into structured forms and delivers advanced autofill sugg…☆22Mar 27, 2024Updated last year
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Dec 8, 2022Updated 3 years ago
- Learning String Alignments for Entity Aliases☆37Mar 21, 2019Updated 7 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,048Feb 21, 2024Updated 2 years ago
- ☆16Jun 7, 2018Updated 7 years ago
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- Checklist para propostas de palestras para Python Brasil☆26Apr 1, 2019Updated 6 years ago
- Template for building a Singer Target☆20Sep 3, 2024Updated last year
- The engine behind Vinta's Lessons Learned page.☆38Dec 26, 2022Updated 3 years ago
- POLITICO's system for managing civic data☆20Dec 7, 2022Updated 3 years ago
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- Tool to automatically fix some issues reported by flake8 (previously forked from autoflake).☆23Jul 29, 2023Updated 2 years ago
- Python 3+ csv file validation framework☆12Oct 2, 2022Updated 3 years ago
- PyCon 2017 talk about using abstraction to help with Library UX☆12May 20, 2017Updated 8 years ago
- A maximum-strength name parser for record linkage.☆39Sep 3, 2025Updated 6 months ago
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆26Mar 13, 2017Updated 9 years ago
- A Django storage backend that names files by hash value.☆39May 25, 2023Updated 2 years ago
- Utilities for working with Django's prefetch_related system☆16Jan 12, 2022Updated 4 years ago
- A simple pytest plugin to disable network on socket level.☆15Jan 12, 2021Updated 5 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,445Jul 29, 2025Updated 7 months ago
- Now included in rigour☆151Nov 24, 2025Updated 3 months ago
- FHIR-native live chat mobile app built with React Native and Medplum☆22Feb 17, 2025Updated last year
- Core library for the datakit CLI framework.☆58Dec 12, 2022Updated 3 years ago
- Workflow, visualizations and data services for managing NGO projects and programs☆11Dec 16, 2022Updated 3 years ago
- Geospatial Extensions for Pyramid☆28Mar 13, 2026Updated last week
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Apr 5, 2023Updated 2 years ago
- a python library for parsing unstructured western names into name components.☆617May 15, 2025Updated 10 months ago
- Record matching and entity resolution at scale in Spark☆36Oct 31, 2023Updated 2 years ago
- A simple HTTP server for queuing lines of text☆11Mar 27, 2017Updated 8 years ago
- CAL-ACCESS Campaign Power Search☆13Nov 2, 2017Updated 8 years ago
- LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record link…☆32Aug 30, 2022Updated 3 years ago
- A collection of CSV/TSV Utilities☆13Jun 2, 2020Updated 5 years ago
- Remotely accessible IPython-enabled debugger☆31Mar 27, 2022Updated 3 years ago
- Services for working with MDS Provider data, built as runnable Docker containers.☆16May 21, 2020Updated 5 years ago