onyxrev / common_nickname_csv
A simple CSV that maps common first names to their nickname counterparts and vice-versa. Released as public domain.
☆28Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for common_nickname_csv
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆27Updated 7 years ago
- Generates a long-form version of every field in the IRS 990 e-file dataset based on the NOPDC "Datathon" concordance☆33Updated 6 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- Download IPEDS complete data files☆38Updated 6 years ago
- Generate a research database from the IRS 990 E-Filer Returns on AWS.☆14Updated 2 years ago
- Predict the Race of a Given Surname Using Census Data☆12Updated last year
- Visualizing Intergenerational Wealth Mobility and Racial Inequality☆9Updated 5 years ago
- Parser and standardizer for politician, individual and organization names.☆128Updated 7 years ago
- Text Thresher crowd sourced text annotator☆16Updated 6 years ago
- ☆10Updated last year
- UK Baby Names Data☆20Updated 2 years ago
- Grimmer's Senate Press Releases☆10Updated 10 years ago
- Computing and reproducibility bootcamp for Duke StatSci graduate students.☆11Updated 8 years ago
- R package to help wrangle campaign finance data 💸☆17Updated last year
- The documentation and scripts for the Local News Dataset☆23Updated 2 years ago
- Demonstration of how dedupe might be used as geocoder☆17Updated 2 years ago
- Add state and county fips codes to data☆41Updated 5 months ago
- Introduction to Topic Modeling for TextXD 2019, 12/3/2019☆10Updated 4 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- R package for using USPS' free address validation api☆24Updated 4 years ago
- Mapping of US zipcodes to Congressional Districts☆51Updated last week
- A awesome list of (large-scale) public datasets on the Internet. (On-going collection)☆24Updated 2 years ago
- Project generator for use with the datakit framework.☆27Updated 8 months ago
- A data package for R containing historical datasets about gender☆23Updated 2 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 3 years ago
- NamSor API v2 R SDK - classify personal names accurately by gender, country of origin, or ethnicity.☆12Updated 3 years ago
- Loads raw FEC filings into a database☆21Updated last year
- SigOpt's public R client☆14Updated last year
- Word-Based Dictionaries for Natural Language☆10Updated 5 years ago
- R tools for GDELT and the Global Knowledge Graph☆14Updated 10 years ago