cleanzr / fasthashLinks
Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).
☆14Updated 6 years ago
Alternatives and similar repositories for fasthash
Users that are interested in fasthash are comparing it to the libraries listed below
Sorting:
- Python wrapper for a C++ Double Metaphone☆15Updated this week
- ☆13Updated 6 years ago
- A maximum-strength name parser for record linkage.☆37Updated 3 weeks ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- MPEDS Annotation Interface☆18Updated 2 years ago
- Fork of the Freely Extensible Biomedical Record Linkage program☆24Updated 8 years ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Updated 5 years ago
- Egonet is a program for the collection and analysis of egocentric network data. It helps you create the questionnaire, collect data, and …☆23Updated 3 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 5 years ago
- DreamBank Visualized - An interactive visualization of over 26,000 dream transcriptions☆15Updated 7 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- R package for Multisource Embeddings for Medical Records☆17Updated 3 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- A utility tool to automate certain tasks with Jupyter notebooks.☆9Updated last year
- Stanford Entity-Resolution Framework☆24Updated 7 years ago
- Interactive Network Graph Visualization for NDTV-generate graphs using D3 animation☆18Updated 9 years ago
- Do things with words. Scale them, mostly.☆17Updated 4 years ago
- Data Scientist code test☆19Updated 5 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- ☆11Updated 6 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 3 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- This code is to demonstrate the use of esquisse to generate ggplot2 with drag and drop☆9Updated 6 years ago
- R wrapper for ArrayFire☆22Updated 7 years ago
- A method for estimating causal effects in time-series data. Uses available data to automatically find natural experiments for identifying…☆15Updated 5 years ago
- HTTP interface to Stan, a package for Bayesian inference.☆40Updated 5 months ago
- Dexter document monitor for MMA☆16Updated last year
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Easy, fast clustering of texts☆18Updated 8 years ago