data61 / blocklib
Python implementations of record linkage blocking techniques.
☆20Updated last year
Alternatives and similar repositories for blocklib:
Users that are interested in blocklib are comparing it to the libraries listed below
- Python implementation of anonymous linkage using cryptographic linkage keys☆65Updated 11 months ago
- Privacy Preserving Record Linkage Service☆26Updated 2 years ago
- ☆13Updated 6 years ago
- ☆16Updated 7 years ago
- A maximum-strength name parser for record linkage.☆37Updated this week
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 6 years ago
- A few end to end examples that use data-describe☆16Updated 2 years ago
- Datasette plugin for authenticating access using API tokens☆12Updated 8 months ago
- A selection of business datasets☆18Updated 5 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 6 months ago
- ☆14Updated this week
- API to validate CSV files and create schemas for compliance with established norms such as RFC4180☆11Updated 5 years ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated 2 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- ☆15Updated 2 years ago
- Fundamental Accounting Concept Relations validation for International Financial Reporting Standards (IFRS).☆12Updated 6 years ago
- Graphistry admin docs: launch, configure, use, & debug☆26Updated last month
- Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning☆13Updated 2 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- ☀️🦶 A lightweight framework for collaborative, open-source feature engineering☆33Updated 3 years ago
- Using NLP to find and extract specific information from long, unstructured documents☆15Updated 6 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- A Python package that simplifies the use of secrets in a Jupyter notebook☆21Updated 3 years ago
- Probabilistic Entity Matching in Python☆13Updated 8 years ago
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated last week
- Privacy preserving synthetic data generation workflows☆20Updated 3 years ago
- A benchmark of globally-optimal anonymization methods for biomedical data☆16Updated 10 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 9 years ago
- Scalable String Similarity Joins in Python☆39Updated 9 months ago