urban-labs / namematchLinks
Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets
☆114Updated 6 months ago
Alternatives and similar repositories for namematch
Users that are interested in namematch are comparing it to the libraries listed below
Sorting:
- Fast, flexible name matching for large datasets☆72Updated 2 weeks ago
- This repository contains the raw data, code, and sources used to create an individual level and state municipal incorporation date datase…☆24Updated 3 months ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆74Updated 5 years ago
- Daily refreshed data on representation certification and unfair labor cases from nlrb.gov☆19Updated last month
- Partition selection, point estimation, pointwise and uniform inference, and graphical procedures using binscatter methods.☆45Updated last week
- ☆80Updated 4 years ago
- ☆22Updated 2 years ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆73Updated 11 months ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆119Updated 2 months ago
- Open Source Proxy Demographic module written in Python☆35Updated last year
- Innovation across ages☆69Updated 2 years ago
- Every big regression is a small regression with weights.☆50Updated 3 weeks ago
- Fast sparse regressions with advanced formula syntax. OLS, GLM, Poisson, Maxlike, and more. High-dimensional fixed effects.☆63Updated last year
- Python package for text mining of time-series data☆73Updated last month
- Data, code, and methodology supporting BuzzFeed News' analysis of the 2016 U.S. Census Survey of Income and Program Participation☆9Updated 2 years ago
- A light-weight wrapper for the Datawrapper API.☆63Updated 10 months ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆70Updated 3 years ago
- Crowd-sourced COVID-19 Dataset Tracking Involuntary Government Restrictions (TIGR)☆29Updated 5 years ago
- Unstructured Code with interesting analysis☆37Updated 7 months ago
- Slides for the Seattle University Causal Inference Class☆135Updated 4 years ago
- A maximum-strength name parser for record linkage.☆37Updated last month
- ☆21Updated last year
- Code to replicate GAIN application in Athey, Chetty, Imbens and Kang (2019) using simulated employment data☆32Updated 3 years ago
- ☆26Updated 5 years ago
- Econometrics and data manipulation functions.☆114Updated 3 years ago
- Hierarchical clustering of 2011-2022 Congress Twitter☆29Updated 2 years ago
- Lots of metrics for quantifying gerrymandering.☆29Updated 2 years ago
- Teenage Driving, Mortality, and Risky Behaviors: Public Use Data Repository☆31Updated 2 years ago
- "How Many Jobs Can be Done at Home?" by Jonathan Dingel and Brent Neiman☆104Updated last month
- Packages of Example Data for The Effect☆139Updated 6 months ago