ncn-foreigners / blockingLinks
An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.
☆14Updated last month
Alternatives and similar repositories for blocking
Users that are interested in blocking are comparing it to the libraries listed below
Sorting:
- Similarity and distance measures for clustering and record linkage applications in R☆18Updated 4 months ago
- ☆11Updated last month
- Record Linkage Toolkit for R☆46Updated 2 weeks ago
- Fast Wild Cluster Bootstrap Inference for Regression Models / OLS in R. Additionally, R port to WildBootTests.jl via the JuliaConnectoR.☆30Updated last year
- Quickly Extract and Marginalize U.S. Census Tables☆17Updated 3 weeks ago
- autumn: Fast, Modern, and Tidy-Friendly Iterative Raking in R.☆45Updated last year
- ☆49Updated 3 months ago
- ☆17Updated 4 years ago
- Run Expressions Across Package Versions☆35Updated 3 weeks ago
- 📊 R package for computing and visualizing fair ML metrics☆32Updated last month
- Build vega-lite specs in R☆48Updated 2 years ago
- WIP!! R Package to Write Posts from R Markdown to Wordpress☆33Updated 2 years ago
- Allow R developers to have multiple R folders inside an R package☆24Updated last year
- Probabilistic Record Linkage in R☆59Updated 3 years ago
- 🍎 R wrapper to New York Times APIs☆16Updated last year
- Continuous integration☆22Updated this week
- Compare run times for various data frame packages☆20Updated 2 months ago
- Pretty Console Output for Tables☆39Updated 3 years ago
- An easier way to tidying pivoted tables.☆29Updated 5 years ago
- Machine learning explanations☆24Updated 3 months ago
- Extract named substrings using named capture groups in regular expressions.☆34Updated 4 years ago
- An R package for modern methods for non-probability samples☆52Updated 2 months ago
- R wrapper for the datengui.de GraphQL API to easily access German regional statistics☆26Updated 4 years ago
- SAE Unit/area Models and Methods for Estimation in R☆25Updated 4 months ago
- R package for 'Efficient Learning of Word Representations and Sentence Classification'☆45Updated last week
- Fast Naive Bayes implementation in R☆42Updated 5 years ago
- A data cube dplyr backend☆40Updated 3 years ago
- See pacha.dev/capybara for a much better GLM implementation. Efficient Fitting of Linear and Generalized Linear Models by using just base…☆19Updated last year
- A Lightweight, Flexible, and Fast Data Validation Package that Can Handle All Sizes of Data☆28Updated 2 years ago
- Superlatively-fast fuzzy-joins in R☆106Updated 2 months ago