joeornstein / fuzzylink
Probabilistic Record Linkage Using Pretrained Text Embeddings
☆11Updated last month
Alternatives and similar repositories for fuzzylink:
Users that are interested in fuzzylink are comparing it to the libraries listed below
- ☆14Updated 9 months ago
- A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction…☆20Updated 6 years ago
- R package to import articles from newspaper databases☆14Updated last year
- Record Linkage Toolkit for R☆43Updated last year
- Tools for Statistical Content Analysis☆16Updated 2 years ago
- Similarity and distance measures for clustering and record linkage applications in R☆18Updated 3 years ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆10Updated this week
- unofficial GESIS Quarto theme for revealjs presentations☆11Updated 2 months ago
- Automatic knowledge classification based on keyword co-occurrrence network☆15Updated last month
- Create interoperable and well described data frames in R☆14Updated 3 weeks ago
- Convert alternative country name to simple country names☆11Updated 4 years ago
- Hierarchical clustering of 2011-2022 Congress Twitter☆29Updated 2 years ago
- DDI with R☆16Updated 4 months ago
- Probabilistic Record Linkage in R☆59Updated 2 years ago
- Read Microsoft Access tables in R☆12Updated 7 months ago
- R package for OpenRefine API☆22Updated 2 years ago
- General Social Survey (GSS) data files packaged for R☆45Updated 5 months ago
- Shiny Application for Qualitative Data Analysis☆26Updated last week
- Implements an algorithim for Latent Dirichlet Allocation using style conventions from the [tidyverse](https://style.tidyverse.org/) and […☆41Updated 3 months ago
- ☑️ U.S. 2020 Democratic Election WSJ Cartogram in R☆9Updated 5 years ago
- Interact with Wikidata and get tidy data frames in response☆26Updated 8 months ago
- 😷Weekly Surveillance Summary of U.S. COVID-19 Activity☆10Updated 4 years ago
- The masterclass "Large Language Models for Data Science" explains what LLMs are, what they can and cannot do, and what they can be used f…☆19Updated 2 months ago
- ☆15Updated 2 weeks ago
- Scale ideological slant of Tweets☆21Updated 5 years ago
- Retrieve and Parse Meta Ad Targeting Data☆40Updated 2 months ago
- Tools to look at xml data. Has functions similar to the `tree` command line tool ( xml_view_tree). Allows one to find paths quickly, incl…☆25Updated 2 years ago
- R Evolved Generalized Software for Sampling Estimates and Errors in Surveys☆12Updated 4 months ago
- quanteda textmodel extensions for classifying documents☆21Updated last year
- API Wrapper for the mediacloud.org API☆16Updated 5 years ago