cleanzr / fasthashLinks
Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).
☆15Updated 6 years ago
Alternatives and similar repositories for fasthash
Users that are interested in fasthash are comparing it to the libraries listed below
Sorting:
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 7 years ago
- MPEDS Annotation Interface☆18Updated 3 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 5 years ago
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- Regular Expression Counts of Terms and Substrings☆25Updated 3 years ago
- Active Learning in R☆47Updated 8 years ago
- Lecture Slides for Introduction to Data Science☆25Updated 2 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last month
- A convenience R package for getting Wikipedia article access statistics (and more).☆77Updated 5 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- R tools for GDELT and the Global Knowledge Graph☆14Updated 11 years ago
- An R Package for Text Analysis☆46Updated 2 years ago
- Project Dense Vectors Text Representation on 2D Plan☆16Updated 6 years ago
- RECSM-UPF Summer School: Social Media and Big Data Research☆22Updated 8 years ago
- R package to Embed All the Things! using StarSpace☆103Updated last year
- A maximum-strength name parser for record linkage.☆39Updated 2 months ago
- Interactive Network Graph Visualization for NDTV-generate graphs using D3 animation☆18Updated 10 years ago
- Amazon Web Services Bundle Package☆15Updated 5 years ago
- Fast, flexible name matching for large datasets☆72Updated 2 months ago
- Do things with words. Scale them, mostly.☆17Updated 4 years ago
- Selective Bayesian Forest Classifier - R package for simultaneous feature selection and classification. See paper: http://arxiv.org/abs/1…☆16Updated 3 years ago
- Model verification, validation, and error analysis☆59Updated last year
- A containerized demo of Airflow using gusty☆39Updated last year
- Library to read a subset of Parquet files☆45Updated 5 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated 3 weeks ago
- Encode Categorical Features (unmaintained)☆32Updated 3 years ago
- This is a read-only mirror of the CRAN R package repository. bsts — Bayesian Structural Time Series☆34Updated 2 months ago
- Distributed Bayesian Entity Resolution in Apache Spark☆58Updated 4 years ago
- Graphical User Interface for Seasonal Adjustment☆23Updated last year