cleanzr / fasthash
Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).
☆14Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for fasthash
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- MPEDS Annotation Interface☆18Updated 2 years ago
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- Encode Categorical Features (unmaintained)☆32Updated 2 years ago
- ☆13Updated 5 years ago
- Selective Bayesian Forest Classifier - R package for simultaneous feature selection and classification. See paper: http://arxiv.org/abs/1…☆16Updated 2 years ago
- A browser user interface for manual labeling of record pairs.☆41Updated last year
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 6 years ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Updated 4 years ago
- A Los Angeles Times analysis found that LAPD officers search blacks and Latinos far more often than whites during traffic stops even thou…☆11Updated 5 years ago
- Regular Expression Counts of Terms and Substrings☆25Updated 2 years ago
- Tidy Simultaneous Confidence Intervals for Multinomial Proportions☆12Updated 4 years ago
- AWS IAM Client Package☆15Updated 4 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Summer School: Social Media and Big Data Research☆13Updated 6 years ago
- R code for reading and writing files in libsvm format☆14Updated 9 years ago
- Slides for 3-day forecasting workshop☆19Updated 6 years ago
- R tools for GDELT and the Global Knowledge Graph☆14Updated 10 years ago
- Implements the model described in "Identification, Interpretability, and Bayesian Word Embeddings"☆18Updated 5 years ago
- Lecture Slides for Introduction to Data Science☆25Updated last year
- Library to read a subset of Parquet files☆43Updated 4 years ago
- A visual analysis tool for exploring multiverse outcomes☆31Updated 2 years ago
- Data Scientist code test☆19Updated 4 years ago
- Instant Access to your Favorite Emoji☆16Updated last year
- An R package for out-of-core regressions☆14Updated 6 years ago
- R package to compute and visualize summary trees☆34Updated 8 years ago
- Course materials for Math 241 at Reed College.☆11Updated 6 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- R bindings to apache arrow☆32Updated 6 years ago