datasciencecampus / pprl_toolkitLinks
The privacy-preserving record linkage toolkit: a proof-of-concept public demo of next-gen data linkage techniques.
☆11Updated last year
Alternatives and similar repositories for pprl_toolkit
Users that are interested in pprl_toolkit are comparing it to the libraries listed below
Sorting:
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- Good Practice Tables - an XlsxWriter wrapper to write consistently formatted statistical tables to Excel.☆38Updated last month
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆13Updated 3 weeks ago
- A maximum-strength name parser for record linkage.☆37Updated last month
- A repository for nowcasting with signature methods☆24Updated 2 years ago
- An R library for working with Table Schema.☆27Updated last month
- ☆17Updated last month
- A Quarto Extension to run sql examples interactively☆38Updated 2 years ago
- R package for Multisource Embeddings for Medical Records☆17Updated 3 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 5 years ago
- Prototype search engine for ONS bulletins☆24Updated last year
- HTTPFS extension for DuckDB. Adds support for an HTTPFileSytem and S3FileSystem.☆18Updated 7 months ago
- The ONS Big Data Team Github pages☆10Updated 4 years ago
- Hierarchical clustering of 2011-2022 Congress Twitter☆29Updated 2 years ago
- Introduction to Econometrics at the University of Oregon (EC421) during Spring quarter, 2020. Taught by Ed Rubin☆15Updated 3 years ago
- Implements an algorithim for Latent Dirichlet Allocation using style conventions from the [tidyverse](https://style.tidyverse.org/) and […☆42Updated 4 months ago
- Tools to look at xml data. Has functions similar to the `tree` command line tool ( xml_view_tree). Allows one to find paths quickly, incl…☆25Updated 2 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 3 weeks ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- Collect and combine data for analysis of the Medicare Advantage market from 2008 through 2015.☆10Updated 2 years ago
- Commuting zones are geographic areas where people live and work and are useful for understanding local economies, as well as how they dif…☆40Updated last year
- hettx: Detecting and Measuring Treatment Effect Variation☆10Updated last year
- Data quality reporting for temporal datasets.☆36Updated 2 weeks ago
- qdapTools is an R package that contains tools associated with the qdap package that may be useful outside of the context of text analysis…☆16Updated 2 years ago
- R package for validating sub-national statistical typologies, re-coding across standard typologies of sub-national statistics. Check out …☆12Updated 2 years ago
- An R Package Skeleton Generator☆21Updated 2 years ago
- Command line interface for publishing to Posit Connect☆34Updated last week
- Data package for the data sets from the book "A Handbook of Small Data Sets" by David Hand (1994)☆16Updated 5 months ago
- Slides and resources for flexdashboard talk at UseR! 2016☆11Updated 8 years ago
- A collection of network analytic (helper) functions that do not deserve a package on their own☆14Updated 7 months ago