ncn-foreigners / blockingLinks
An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.
β13Updated last month
Alternatives and similar repositories for blocking
Users that are interested in blocking are comparing it to the libraries listed below
Sorting:
- Similarity and distance measures for clustering and record linkage applications in Rβ18Updated 3 years ago
- Construct natural-language lists with internationalization in R πβ20Updated 4 months ago
- Record Linkage Toolkit for Rβ43Updated last year
- A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extractionβ¦β20Updated 6 years ago
- Quickly Extract and Marginalize U.S. Census Tablesβ16Updated 4 months ago
- Machine learning explanationsβ23Updated 3 months ago
- An R-package to build nesting or hierarchical structuresβ13Updated 3 years ago
- Convert alternative country name to simple country namesβ11Updated 5 years ago
- Tools for Working with ZIP Codes, ZCTAs, and 3-digit ZCTAs (R package)β12Updated 2 months ago
- An easier way to tidying pivoted tables.β29Updated 5 years ago
- See pacha.dev/capybara for a much better GLM implementation. Efficient Fitting of Linear and Generalized Linear Models by using just baseβ¦β19Updated 11 months ago
- Visual tools to help machine learning model selectionβ15Updated 4 years ago
- β45Updated 6 years ago
- Survey statistics in a databaseβ12Updated 10 months ago
- Show Diffs Between Piped Stepsβ20Updated 3 years ago
- Extract named substrings using named capture groups in regular expressions.β34Updated 4 years ago
- Run Functions Across Package Versionsβ17Updated this week
- R package for styling graphics for RSS publications.β15Updated last year
- Continuous integrationβ20Updated last year
- Magical string interpolationβ17Updated 2 months ago
- Word Factor Vectorsβ32Updated 5 years ago
- β23Updated 7 months ago
- Extend tinytest with diffobjβ22Updated 5 months ago
- Graphical Display For Data Frame Outputβ17Updated 3 months ago
- Formatting extension for Quartoβ12Updated last year
- Fast Wild Cluster Bootstrap Inference for Regression Models / OLS in R. Additionally, R port to WildBootTests.jl via the JuliaConnectoR.β28Updated 11 months ago
- Hugging face tokenizers for R using extendrβ11Updated 2 years ago
- https://rstudio.github.io/connections/β57Updated last year
- Work-in-progress R package for handling partial datetimesβ17Updated last year
- and interface for plotting calendar months with date input in ggplot2β35Updated 10 months ago