urban-labs / namematchLinks
Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets
☆115Updated 6 months ago
Alternatives and similar repositories for namematch
Users that are interested in namematch are comparing it to the libraries listed below
Sorting:
- Fast, flexible name matching for large datasets☆72Updated last month
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆74Updated 11 months ago
- ☆80Updated 4 years ago
- This repository contains the raw data, code, and sources used to create an individual level and state municipal incorporation date datase…☆24Updated 3 months ago
- Partition selection, point estimation, pointwise and uniform inference, and graphical procedures using binscatter methods.☆45Updated 3 weeks ago
- Daily refreshed data on representation certification and unfair labor cases from nlrb.gov☆19Updated 2 months ago
- Text and statistics utilities from Pew Research Center☆84Updated 3 years ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆120Updated 2 months ago
- A maximum-strength name parser for record linkage.☆37Updated last week
- ☆22Updated 2 years ago
- Every big regression is a small regression with weights.☆51Updated last month
- Python package for text mining of time-series data☆73Updated last month
- Hierarchical clustering of 2011-2022 Congress Twitter☆29Updated 2 years ago
- ☆21Updated last year
- Open Source Proxy Demographic module written in Python☆35Updated last year
- Innovation across ages☆70Updated 2 years ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆74Updated 5 years ago
- Data, code, and methodology supporting BuzzFeed News' analysis of the 2016 U.S. Census Survey of Income and Program Participation☆9Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆69Updated 3 years ago
- Packages of Example Data for The Effect☆140Updated 7 months ago
- Econometrics and data manipulation functions.☆115Updated 3 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- A light-weight wrapper for the Datawrapper API.☆63Updated 11 months ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- pytorch implementation of BLP'95☆26Updated 5 years ago
- Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP…☆10Updated 3 years ago
- MPEDS Annotation Interface☆18Updated 2 years ago
- Lots of metrics for quantifying gerrymandering.☆29Updated 2 years ago
- Code to replicate GAIN application in Athey, Chetty, Imbens and Kang (2019) using simulated employment data☆32Updated 3 years ago