data61 / blocklib
Python implementations of record linkage blocking techniques.
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for blocklib
- CLK hash: hash pii for entity matching☆47Updated last year
- Privacy Preserving Record Linkage Service☆26Updated last year
- Python implementation of anonymous linkage using cryptographic linkage keys☆63Updated 5 months ago
- ☆13Updated 5 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- A maximum-strength name parser for record linkage.☆32Updated 3 months ago
- A financial disclosure data extraction tool.☆13Updated last year
- Burglary prediction for mortals☆10Updated 5 months ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 5 years ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated last year
- JupyterLite as a Datasette plugin☆11Updated 3 years ago
- A Python package that simplifies the use of secrets in a Jupyter notebook☆21Updated 3 years ago
- Plugin for Intake to read from SQL servers☆15Updated last year
- Record matching and entity resolution at scale in Spark☆31Updated last year
- Apache Spark based framework for analysis A/B experiments☆11Updated last week
- Datasette plugin for authenticating access using API tokens☆12Updated 2 months ago
- Graphistry admin docs: launch, configure, use, & debug☆23Updated 2 weeks ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆51Updated last week
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Reddit Gender Text-Classification.☆11Updated last year
- Datasette plugin for searching all searchable tables at once☆19Updated 2 months ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 5 years ago
- Stanford Entity-Resolution Framework☆23Updated 6 years ago
- Examples of vector DB indexing and query with various vector databases.☆12Updated 3 weeks ago
- Advanced data wrangling for python☆11Updated last year
- Generate Elasticsearch indexes based on Table Schema descriptors.☆10Updated 3 years ago
- Online service for analyzing research profiles of scientists and conferences☆12Updated 2 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆23Updated last year
- Jupyterlab extension to publish to Kyso☆2Updated last year