ncn-foreigners / BlockingPy
Blocking records for record linkage and data deduplication based on ANN algorithms in Python.
☆12Updated last week
Alternatives and similar repositories for BlockingPy:
Users that are interested in BlockingPy are comparing it to the libraries listed below
- Prototype search engine for ONS bulletins☆23Updated 11 months ago
- Similarity and distance measures for clustering and record linkage applications in R☆18Updated 3 years ago
- Lightweight validation tool for checking function arguments and data analysis scripts.☆11Updated 3 months ago
- ☆10Updated 3 years ago
- Implements an algorithim for Latent Dirichlet Allocation using style conventions from the [tidyverse](https://style.tidyverse.org/) and […☆41Updated 2 months ago
- A Quarto Extension to run sql examples interactively☆35Updated last year
- pseudopeople is a Python package that generates realistic simulated data about a fictional United States population, designed for use in …☆20Updated this week
- Introduction to DuckDB and Polars☆23Updated 4 months ago
- Clustering and Link Prediction Evaluation in R☆12Updated last year
- Sampling Methods for Big Data☆10Updated 6 years ago
- High-dimensional fixed effect estimation with pytorch☆18Updated 4 years ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Updated 4 years ago
- Fixed-effects estimations☆8Updated 10 months ago
- ☆11Updated 7 months ago
- Every big regression is a small regression with weights.☆41Updated last month
- Source code for spatial analysis website☆17Updated 2 years ago
- ******* In this fork I only work on the r/ directory, please refer to the upstream repo for all of Arrow******☆15Updated 3 years ago
- A repository for nowcasting with signature methods☆25Updated last year
- Transform base maps using log-azimuthal projection☆16Updated 10 months ago
- An R package "rfinterval": Predictive Inference on Random Forests☆13Updated 5 years ago
- An R interface to Rust's h3o library☆23Updated 3 months ago
- Python interface to quarto-cli☆20Updated 9 months ago
- Quickly Extract and Marginalize U.S. Census Tables☆15Updated last month
- A collection of network analytic (helper) functions that do not deserve a package on their own☆14Updated 5 months ago
- Partial content templates for Quarto☆18Updated last week
- A back-end agnostic spatial data frame inspired by rust trait implementations☆27Updated last year
- ☆16Updated last year
- emdi: estimating and mapping regionally disaggregated indicators☆15Updated 9 months ago
- Create and plot the Tissot Indicatrix☆17Updated last year
- Generate assumptions and caveats logs from code comments.☆9Updated 3 years ago