data61 / blocklib
Python implementations of record linkage blocking techniques.
☆20Updated last year
Alternatives and similar repositories for blocklib:
Users that are interested in blocklib are comparing it to the libraries listed below
- CLK hash: hash pii for entity matching☆47Updated last week
- Privacy Preserving Record Linkage Service☆26Updated 2 years ago
- Python implementation of anonymous linkage using cryptographic linkage keys☆65Updated 10 months ago
- ☆13Updated 5 years ago
- A maximum-strength name parser for record linkage.☆36Updated last month
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Datasette plugin for authenticating access using API tokens☆11Updated 6 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆56Updated 3 months ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 5 months ago
- utilities for filesystem exploration and automated builds☆21Updated last month
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning☆13Updated 2 years ago
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models☆13Updated 2 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Updated last year
- stemgraphic python package for visualization of data and text☆18Updated 4 years ago
- Exploration of the U.S. rulesets as a network☆15Updated 2 years ago
- Scrapers for US municipal governments.☆10Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A browser user interface for manual labeling of record pairs.☆45Updated last year
- Code metrics for Python code.☆11Updated 10 years ago
- ☀️🦶 A lightweight framework for collaborative, open-source feature engineering☆32Updated 3 years ago
- Smart Arguments Suite (smart-arg) is a slim and handy python lib that helps one work safely and conveniently with command line arguments.☆23Updated 3 years ago
- RESTful API around the PETRARCH coding software☆10Updated 3 years ago
- A financial disclosure data extraction tool.☆14Updated last year
- An open source data analysis platform with features for users with a range of technical skills☆47Updated this week
- pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other do…☆10Updated last year
- Plugin for Intake to read from SQL servers☆15Updated last year
- Tools for building SQLite databases from files and directories☆12Updated last year