cleanzr / record-linkage-tutorial
A tutorial on entity resolution (record linkage or de-duplication)
☆63Updated 4 years ago
Alternatives and similar repositories for record-linkage-tutorial:
Users that are interested in record-linkage-tutorial are comparing it to the libraries listed below
- Fast, flexible name matching for large datasets☆71Updated last year
- Probabilistic Record Linkage in R☆59Updated 2 years ago
- DEPRECATED - The Concept Mover's Distance Method is now available in the text2map package. Concept Mover's Distance is a way to measure…☆27Updated 3 years ago
- An R package to assess the effects of text preprocessing decisions.☆66Updated 3 years ago
- Inverse regression analysis of text☆29Updated 7 years ago
- Record Linkage Toolkit for R☆43Updated last year
- Methods and thoughts on defining geographic markets for health care services, i.e., a guided tour of a particularly complex rabbit hole.☆42Updated last year
- R package associated with Benoit, Munger and Spirling (2017) paper(s)☆43Updated 3 years ago
- R package fastLink: Fast Probabilistic Record Linkage☆279Updated last year
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- Visualizations for the stm package☆59Updated 9 years ago
- R package to Embed All the Things! using StarSpace☆101Updated last year
- R backbone package - Extract the backbone from weighted and unweighted networks☆41Updated 3 months ago
- An R package to gather, munge, and convert event datasets into temporal event-networks.☆11Updated 6 years ago
- NamSor API v2 R SDK - classify personal names accurately by gender, country of origin, or ethnicity.☆12Updated 4 years ago
- Extract effects from estimateEffect in the stm package☆48Updated 4 years ago
- An integrated framework in R for textual sentiment time series aggregation and prediction☆83Updated 3 years ago
- Similarity and distance measures for clustering and record linkage applications in R☆18Updated 3 years ago
- Variable importance through targeted causal inference, with Alan Hubbard☆57Updated 2 years ago
- An R corpus class for tokenized texts☆31Updated 6 months ago
- Text-Based Ideal Points☆44Updated 2 years ago
- A rolling version of the Latent Dirichlet Allocation.☆12Updated last year
- A PhD-level workshop & coding syllabus for teaching Social Network Analysis (SNA) in R☆26Updated 3 years ago
- Implements an algorithim for Latent Dirichlet Allocation using style conventions from the [tidyverse](https://style.tidyverse.org/) and […☆41Updated 2 months ago
- R-package for text mining with the Corpus Workbench (CWB) as backend☆50Updated 6 months ago
- ☆17Updated 2 years ago
- sdcMicro☆84Updated last month
- Paper and related materials for Rodriguez & Spirling (JOP, 2022) word embeddings overview and assessment☆46Updated 3 years ago
- First R package to propose user-friendly functions to compute a series of indices commonly used in Economic Geography.☆42Updated last year
- ☆32Updated 2 years ago